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Transduction of recombinases for inducible gene targeting 

The present invention provides the use of a fusion protein comprising a 
site-specific DNA recombinase domain and a protein transduction domain 
for preparing an agent for inducing target gene alteration in a living 
organism or in cultured cells, suitable fusion proteins and a method for the 
production of said fusion proteins. 

Background 

For some years targeted mutagenesis in totipotent mouse embryonic stem 
(ES) cells has been used to inactivate genes, for which cloned sequences 
were available (Capecchi, Trends in Genetics 5, 70 - 76 (1989)). Since ES 
cells can pass mutations induced in vitro to transgenic offspring in vivo, it 
is possible to analyze the consequences of gene disruption in the context 
of the entire organism. Thus, numerous mouse strains with functionally 
inactivated genes ("knock out mice") have been created by this 
technology and utilized to study the biological function of a variety of 
genes. 

A refined method of targeted mutagenesis, referred to as conditional 
mutagenesis, employs a site-specific recombination system (e.g. Cre/loxP 
or Flp/frt - Sauer and Henderson, N. Proc. Natl. Acad. Sci. USA 85, 5166- 
5170 (1988); Senecoff et al., 3. Mol. Biol., 201, 405 - 421 (1988)) which 
enables a temporally. and/or spatially restricted alteration of target genes 
(Rajewsky et al., J. Clin. Invest, 98, 600 - 603 (1996)). The creation of 
conditional mouse mutants requires the generation of two mouse strains, 
i.e. the recombinase recognition strain and the recombinase expressing 
strain. The recombinase recognition strain is generated by homologous 
recombination in ES cells as described above except that the targeted 
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exon(s) is (are) flanked by two recombinase recognition sequences ; / 
(hereinafter "RRS"; e.g. loxP or frt). The type of recombination event 
mediated by the recombinase depends on the disposition of the RRS, with 
deletions, inversions, translocations and integrations being possible 
(Torres and Kuhn, Oxford University Press, Oxford, New York (1997)). By 
placing the RRS into introns, an interference with gene expression before 
recombination can be avoided. The recombinase expressing strain 
contains a recombinase transgene (e.g. Cre, Flp) whose expression is 
either restricted to certain cells and tissues or is inducible by external 
agents. Crossing of the recombinase recognition strain with the 
recombinase expressing strain recombines the RRS-flanked exons from 
the doubly transgenic offspring in a prespecified temporally and/or 
spatially restricted manner. Thus, the method allows the temporal analysis 
of gene function in particular cells and tissues of otherwise widely 
expressed genes. Moreover, it enables the analysis of gene function in the 
adult organism by circumventing embryonic lethality which is frequently 
the consequence of gene mutation. For pharmaceutical research, aiming 
to validate the utility of genes and their products as targets for drug 
development, inducible mutations provide an excellent genetic tool. 
However, the current systems for inducible recombinase expression in 
transgenic animals suffer from a certain degree of leakiness in the 
absence of the inducer (Kuhn et al., Science 269(5229): 1427-9 (1995); 
Schwenk et al., Nucleic Acids Res.; 26(6): 1427-32 (1998)). Furthermore, 
the. .generation of conditional mutants is a time consuming and labor 
intensive procedure, since the recombinase recognition strain and the 
recombinase expressing strain have to be breed at least over two 
generations in order to obtain animals carrying both, the recombinase 
transgene and two copies of the RRS-flanked target gene sequence. 

Protein tranduction domains (hereinafter shortly referred to as "PTD") that " 
have the ability to cross cell membranes were identified, e.g. in the 
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Antennapedia protein from Drosophila (Vives et al., J. Biol. Chem,' 
272(25):16010-7 (1997)), Kaposi fibroblast growth factor (Kaposi FGF; 
Lin et al., J. Biol. Chem. 270: 14255-58 (1995)), VP22 from HSV (Elliott 
and O'Hare, Cell, 88(2):223-33 (1997)) and TAT from HIV (Green and 
Loewenstein, Cell, 55(6):1179-88 (1988); Frankel and Pabo, Cell, 
55(6): 1189-93 (1988)). WO 99/29721 moreover mentions TAT mutants 
having an enhanced activity as compared to the wild-type peptide. 
Fusion of PTDs to heterologuous proteins conferred the ability to transduce 
into cultured cells (Fawell et al., Proc. Natl. Acad. Sci. USA, 91(2):664-8 
, -, (1994); Elliott and O'Hare (1997), Phelan et al., Nature Biotech. 16; 440- 
443 (1998) and Dilber et al., Gene Ther., 6(1): 12-21 (1999)). Dalby and 
Bennett showed that a fusion protein consisting of VP22 and functional Flp 
recombinase translocated between cells in culture (from COS-1 cells 
transfected with VP22-Flp to CHO cells carrying Flp recognition sites (FRT 
sites); see Dalby and Bennett, Invitrogen, Expressions 6.2, page 13 
(1999)). Further WO 99/11809 mentions a fusion protein Antp-Cre and 
emphasizes that it may be used to deliver the Cre into the cell which 
recombines inside the cell nucleus. It is mentioned that the fusion protein 
is suitable for manipulating genomic DNA at precise locations in a 
temporal regulated manner. 

Furthermore, a recent report demonstrated that the 8-galactosidase 
protein fused to the 11 amino acids PTD from the HIV TAT protein can 
infiltrate all tissues of living mice reaching every single cell (Schwarze et 
al., Science, 285(5433): 1569-72 (1999)). Finally, WO 99/60142 discloses 
vector constructs for gene therapy carrying a tumor cell sensitizing gene, 
a sensitizing gene expression regulatory system, a control gene and a 
control gene expression regulatory system, wherein the control gene can 
be a fusion gene consisting of a recombinase (viz. Cre or Flp) and a 
trafficking protein (viz. VP22). 
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With regard to the fusion protein Antp-Cre of WO 99/11809, it is however, -" V 
general knowledge in the art that the Antennapedia PTD is not a generally 
applicable transducing protein, namely it has only a limited activity with 
proteins having more than 100 amino acid residues (Derossi et al., Trends 
Cell Biol. 8: 84-87, 1998). In view of the limited transducing activity of the 
Antp PTD and the size of the generally known recombinases (ranging from 
about 200 to about 600 amino acid residues), it was desirable to provide a 
more potent system for the transduction of recombinases. It was, 
however, not clear for a person skilled in the art whether PTDs would be 
effective at all with recombinases for the following reasons: 

(i) only a single example of PTD-mediated delivery of proteins (above 100 
amino acid residues) in vivo has been reported so far (Schwarze et al., 
Science, 285(5433): 1569-72 (1999); Fawell et al., PNAS, 91: 664-68 
(1994); both references describing the TAT-mediated transduction of 8- 
galactosidase in mice); 

(ii) it is known that - due to defolding and refolding processes - the 
transduction of native proteins into cells may result in a significant loss of 
protein activity (e.g., as described for TAT-GFP; Schwarze et al, Trends 
Cell Biol. 10: 290-95 (2000)); 

(iii) neither the number of protein molecules that can be transferred into a 
cell by a given translocation domain has been systematically determined, 
nor the number of Cre molecules in the cell nucleus that is required for 
efficient recombination; 

(iv) the delivery of active proteins requires unfolding- and proper refolding 
which is unpredictable for a given protein (Bonifaci et al., AIDS 9: 
995-1000 1995); and 

(v) the mechanism by which protein transduction domains facilitate 
protein transduction in unknown and several findings have been published 
that rule out classical receptor-, transporter-, endosome- or endocytosis- 
mediated processes in the transduction of Ant, TAT and VP22 (G. Eliott, P. 
O'Hare, Cell 88, 223-233 (1997); D.A. Mann, A.D. Frankel, EMBO. J. 10, 
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1733-1739 (1991); D. Derossi et al., J. Biol. Chem. 269, 10444-10450 '' 
(1994); D. Derossi et al., J. Biol. Chem. 271, 18188-18193 (1996); E. 
Vives et al., J. Biol. Chem. 272, 16010-16017 (1997)). 

Moreover, there was still the need for a generally applicable method where 
the genetic manipulation can be performed in both, endogenous genes 
and transgenes. 

Summary of the Invention 

It was found that site-specific DNA recombinase proteins can be 
translocated into cells of a living organism when fused to specific protein 
transduction domains, namely transduction domains being derived from 
the VP22 protein of HSV or from the TAT protein of HIV. Thus, whenever a 
gene mutation is desired, recombination is induced upon the injection of 
the appropriate site-specific recombinase fused to a transduction domain 
into such a living organism (provided, however, that said organism carries 
at least one appropriate RRS integrated in the genome). 

The present invention thus provides 

(1) the use of a fusion protein comprising 

(a) a site-specific DNA recombinase domain and 

(b) a protein transduction domain (PTD) 

for preparing an agent for inducing target gene alterations in a living 
organism or cell culture, wherein said living organism carries at least one 
or more recognition sites for said site-specific DNA recombinase integrated 
in its genome; 

(2) a method for inducing gene alterations in a living organism which 
comprises administering to said living organism a fusion protein 
comprising a site-specific DNA recombinase domain and a PTD as defined 
in (1) above, wherein said living organism carries at least one or more 
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recognition sites for said site-specific DNA recombinase integrated in its 
genome; 

(3) a fusion protein comprising 

(a) a site-specific DNA recombinase domain and 

(b) a PTD being derived from the VP22 protein of HSV or from the TAT 
protein of HIV 

provided that when the site-specific DNA recombinase domain is wild-type 
Cre or Flp then the PTD is not the full length VP22 PTD of HSV (i.e., the 
fusion protein is not identical to the fusion protein of Dalby and Bennett, 
Invitrogen, Expressions 6.2, page 13 (1999) and of WO 99/60142); 

(4) a DNA sequence coding for the fusion protein of (3) above; 

(5) a vector comprising the DNA sequence as defined in (4) above; 

(6) a host cell transformed with the vector of (5) above and/or comprising 
the DNA of (4) above; 

(7) a method for producing the fusion protein of (1) above which 
comprises culturing the transformed host cell of (6) above and isolating 
the fusion protein; and 

(8) an injectable composition comprising the fusion protein as defined in 
(1) or (3) above. 

The invention is further illustrated by the appended Figures and is 
explained in detail below. 

Description of the Figures 

Fig- 1: Generation of induced mouse mutants using purified fusion 
proteins. 

A: Expression of the fusion protein consisting of the site-specific DNA 
recombinase (e.g. Cre) and the protein transduction domain (e.g. the HIV 
derived TAT peptide) in prokaryotic or eukaryotic cells. 
B: Extraction and purification of the expressed fusion protein (e.g. as 
described in Nagahara etal., Nat. Med. 4 (12): 1449-52 (1998)). 
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C: Injection of the purified fusion protein into mice carrying the RRS- 
flanked target sequence. 

D: Analysis of the pattern of induced target gene recombination and the 
resulting phenotype. 
Triangle: RRS. 

Rq. 2: Scheme of the bacterial expression vector pT7-TACS (SEQ ID 
NO:16). The coding region of the 11 amino acid protein transduction 
domain of HIV TAT protein is fused to the IM-terminus of the Cre 
recombinase protein sequence. The 10-amino-acid strep tag and the 
protease factor Xa recognition sequence are fused to the C-terminus, The 
T7 promoter permits expression of TAT-Cre protein in E. colL 

Rq. 3: Detection of purified TAT-Cre protein by Coomassie staining and 
Western blot analysis. 

A: Coomassie stained SDS-PAGE gel. Lane 1: 10 kDa ladder (Life 
Technologies, Cat. No.: 10064-012), 2: 1000 ng BSA, 3: 750 ng BSA, 4: 
500 ng BSA, 5: 100 ng BSA, 6: 50 ng BSA, 7: 5 pi TAT-Cre, 8: 1 pi TAT- 
Cre in Bicine buffer. 

B: Western blot analysis using an alkaline phosphatase-conjugated anti- 
strep tag antibody (IBA, Cat. No: 2-1503-001). Lane 1: MultiMark 
(Invltrogen, Cat. No.: LC5725), 2: 7 pi TAT-Cre, 3: 5 pi TAT-Cre, 4: 2,5 pi 
TAT-Cre, 5: 1,25 pi TAT-Cre in Bicine buffer. 

Rq. 4: X-Gal staining _pf_M5Pax8 cells treated with TAT-Cre protein. 
M5Pax8 fibroblasts where treated for 18 h with 3,5 (A), 6,9 (B) and 13,8 
pg/ml TAT-Cre protein (C) in serum-free medium. Four days after 
treatment, cells were fixed and stained with X-Gal. 

Rq. 5: Measurement of B-galactosidase activity in cell lysates. M5Pax8 
fibroblasts where treated for 18 h with increasing concentrations of TAT- 
Cre, as indicated, or transiently transfected with either expression vectors 
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for Cre (pCMV-I-Cre-pA, see SEQ ID NO: 29) or 6-galactosidase (pCMV-I- 
6-pA, see SEQ ID NO:30). Four days after treatment, cells were lysed and 
the 8-galactosidase activities were determined. 

FiQ. 6: PCR detection of TAT-Cre mediated recombination in mice. 
A: PCR-analysis of genomic DNA from duodenum (lane 2), liver (3), 
kidney (4), spleen (5), muscle (6), lung (7), tail (8) and brain (9) of a 
plnl3 mouse treated three times with intraperitoneal injections of 75 ug 
TAT Cre protein at two-day-intervals. Deletion of the loxP-flanked DNA 
segment is indicated by the presence of the about 400 bp fragment Lane 
1: l-kb=ladder (Life Technologies). 

B: PCR strategy to detect Cre-mediated deletion of the loxP-flanked DIMA 
segment. Arrows indicate the positions of the primers. 
C: PCR-analysis of genomic DNA from spleen of a plnl3 mouse treated 
three times with intraperitoneal injections of 75 ug TAT Cre protein at two- 
day-intervals (lane 4). To confirm the presence of the BamH I restriction 
site, the PCR product was digested with BamH I which produces two 
diagnostic fragments of about 190 and about 210 bp (5). As a control, tail 
DNA from untreated mice carrying the loxP-flanked (lane 2) and the 
detected plnl3 allele (3) was subjected to PCR amplification. Lane 1: 100 
bp ladder (Life Technologies), lane 6: 1 kb ladder (Life Technologies). 

Fig. 7: Scheme of the bacterial expression vectors pT7-VPCS (SEQ ID 
NO: 17) and pCRT7-AVPCS (SEQ ID NO: 15). The coding region of the 301 
amino acid protein transduction domain of HSV VP22 protein (A) or the 
truncated 143 amino acid AVP22 domain (B) is fused to the N-terminus of 
the Cre recombinase protein sequence. The 10-amino-acld strep tag and 
the protease factor Xa recognition sequence are fused to the C-terminus. 
The T7 promoter allows the expression of VP22-Cre and AVP22-Cre fusion 
proteins in E. coli. The sequence in pCRT7-AVPCS encoding the 15 amino 
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acid N-terminal leader sequence is used for enhanced protein stability 
(Invitrogen). 

Fig- 8: Detection .of the purified VP22-Cre and AVP22-Cre fusion proteins 
by Coomassie staining and Western blot analysis. 
A: Detection of VP22-Cre protein in a Coomassie-stained SDS-PAGE gel. 
Lane 1: 10 kDa ladder, 2: 1000 ng BSA, 3: 500 ng BSA, 4: 100 ng BSA, 
5: inclusion body protein extract before chromatography, 6: unbound 
protein, 7: fraction 17, 8: fraction 18, 9: fraction 19, 10: fraction 20. The 
position of the 75 kDa VP22-Cre protein is indicated by the arrow head. 
B: Detection of VP22-Cre protein by Western blot analysis using an 
alkaline phosphatase-conjugated anti-strep tag antibody (IBA, Cat. No.: 2- 
1503-001). Lane 1: MultiMark (Invitrogen), 2: inclusion body protein 
extract before chromatography, 3: unbound protein, 4: fraction 10, 5: 
fraction 11, 5: fraction 16, 6: fraction 17, 7: fraction 18, 8: fraction 19, 9: 
fraction 19, 10: fraction 20. 

C: Detection of AVP22-Cre protein in a Coomassie-stained SDS-PAGE gel. 
Lane 1: 10 kDa ladder, 2: inclusion body protein extract before 
chromatography, 3: unbound protein, 4: fraction 1, 5: fraction 8, 6: 
fraction 9, 7: fraction 15, 8: 100 ng BSA, 9: 500 ng BSA, 10: 1000 ng 
BSA. The position of the 60 kDa AVP22-Cre protein is indicated by the 
arrow head. 

D: Detection of AVP22-Cre protein by Western blot analysis using a 
alkaline phosphatase-conjugated anti-strep tag antibody (IBA, Cat. No.: 2- 
1503-001). Lane 1: MultiMark (Invitrogen), 2: inclusion body protein 
extract before chromatography, 3: unbound protein, 4: fraction 4, 5: 
fraction 8, 6: fraction 10, 7: fraction 12, 8: soluble protein extract before 
chromatography, 9: unbound protein, 10: fraction 7. 



Fig. 9: X-Gal staining of M5Pax8 cells treated with VP22-Cre and AVP22- 
Cre fusion proteins. M5Pax8 fibroblasts where treated for 18 h with either 
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Bicine buffer (A), 0.5 ug/ml VP22-Cre (B) or 3.75 g/ml AVP22-Cre (C) in 
serum-free medium. Four days after treatment, cells were fixed and 
stained with X-Gal. 

Fig. 10: Measurement of 6-galactosidase activity in cell lysates. M5Pax8 
fibroblasts where treated for 18 h with VP22-Cre, AVP22-Cre or Bicine 
buffer alone, as Indicated or transiently transfected with expression 
vectors for Cre (pCMV-I-Cre-pA, see SEQ ID NO: 29) or 6-galactosidase 
(pCMV-I-B-pA, see SEQ ID NO:30). Four days after treatment, cells were 
,'ysed and the S-galactosidase activities were determined. 

Fia. 11: PCR detection of Cre mediated recombination in cells treated with 
VP22-Cre and AVP22-Cre fusion proteins shown in SEQ ID NOs: 21 and 
14, respectively). 

A: PCR-analysis of genomic DNA isolated from M5Pax8 fibroblasts. Cells 
were transiently transfected with a Cre expression vector (lane 2) or 
treated for 18 h with either buffer alone (lane 3), 7.5 ug/ml VP22-Cre (4, 
5) or 15 ug/ml AVP22-Cre (6, 7) in serum-free medium. Four days after 
treatment, genomic DNA was extracted and subjected to PCR 
amplification. Deletion of the loxP-flanked DNA segment is indicated by the 
presence of the 226 bp DNA fragment. To confirm the presence of the Nco 
I restriction site in the recombined allele, the PCR products were digested 
with Nco I which produces two diagnostic fragments of 85bp and 141bp 
"(lanes 5 and 7). Lane 1: 100 bp ladder (Life Technologies), lane 8: 1 kb 
ladder (Life Technologies). 

B: PCR strategy to detect Cre-medlated deletion of the loxP-flanked DNA 
segment. Arrows indicate the positions of the primers. 

Detailed Description of the Invention 

The expression "target sequences" according to the present invention 
means all kind of sequences which may be mutated (viz. deleted, 
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translocated, integrated and/or inverted) by the action of the . 
recombinase. The number of RRS in the target sequence depends on the 
kind of mutation to be performed by the recombinase. For most of the 
mutations (especially for deletions and invertions) two RRS are required 
which are flanking the sequence to be mutated (deleted or inverted). For 
some kinds of integrations only one RRS may be necessary within the 
target sequence. 

The "living organisms" according to the present invention are multi-cell 
organisms and can be vertebrates such as mammals (e.g., rodents such 
as mice or rats) or non-mammals (e.g., fish) or can be invertebrates such 
as insects or worms, or can be plants (higher plants, algi or fungi). Most 
preferred living organisms are mice and fish. 

"Cell culture" according to the present invention include cells isolated from 
the above defined living organism and cultured in vitro. These cells can be 
transformed (immortalized) or untransformed (directly derived from the 
living organism; primary cell culture). 

The site-specific DIMA recombinase domain within the fusion protein of the 
invention of the present application is preferably selected from a 
recombinase protein derived from Cre, Flp, <|>C31 recombinase (Thorpe and 
Smith, Proc. Natl. Acad. Sci, USA, vol. 95, 5505-5510 (1998)), y8 
resolvase (Schwickardi and Droge, FEBS letters 471:147-150 (2000) and 
R recombinase (Araki et al., J. Mol. Biol., 182, 191-203 (1985)). The 
preferred recombinases are Cre and mutants thereof (preferably the Cre 
variant of aa 15 to 357 of SEQ ID NO: 2 or aa 325-667 of SEQ ID NO: 6) 
and Flp and variants thereof including Flpe (preferably the Flp variant of 
aa 15 to 437 of SEQ ID NO: 4 or aa 325 to 747 of SEQ ID NO: 8). 
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The protein transduction domain according to the present invention 

includes, but is not limited to, the PTDs mentioned in Background of the 

Invention. The PTD preferably is derived from the VP22 protein of HSV or 

from the TAT protein of HIV. Suitable TAT proteins include, but are not 

limited to, proteins comprising (i) the amino acid sequence shown in SEQ 

ID NO: 10 and mutant thereof such as 

(ii) proteins comprising the amino acid 

AGRKKRRQRRR (SEQ ID NO: 22) 

YARKARRQARR (SEQ ID NO: 23) 

YARAAARQARA (SEQ ID NO: 24) 

YARAARRAARR (SEQ ID NO: 25) 

YARAARRAARA (SEQ ID NO: 26) 

YARRRRRRRRR (SEQ ID NO: 27) 

YAAARRRRRRR (SEQ ID NO: 28) 

as known from WO 99/29721. Preferred are transduction domains, 
consisting of the TAT proteins (i) and (ii) above. 

Suitable VP22 proteins include, but are not limited to, the wild-type VP22 
protein, i.e., a protein comprising amino acids 1 to 302 of SEQ ID No:21, 
and truncated forms thereof. Truncated VP22 proteins in accordance with 
the present invention can be those lacking 1 to 158 amino acid residues at 
their N-terminal end. The most preferred VP22 protein is the truncated 
VP22 PTD comprising amino acid residues 16 to 157 of SEQ ID NO: 14. 

The fusion of the two domains of the fusion protein can occur at any 
possible position, i.e., the protein transduction domain can be fused to the 
N- or C-terminal of the site-specific DNA recombinase or can be fused to 
active sites within the site-specific DNA recombinase. Preferably the 
protein transfusion domain is fused to the N-terminal of the site-specific 
DNA recombinase domain. 
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The protein transduction domain can be fused to the site-specific DNA 
recombinase either through a direct chemical bond or through a linker 
molecule. Such linker molecule can be any bivalent chemical structure 
capable of linking the two domains. The preferred linker molecule 
according to the present invention is a short peptide, e.g., having 1 to 20, 
preferably 1 to 10, amino acid residues. Specifically preferred short 
peptides are essentially consisting of Gly, Ala and/or Leu. 

The fusion protein of the invention of the present application may further 
comprise other functional sequences such as secretion conferring signals, 
nuclear localisation signals and/or signals conferring protein stabilisation. 

In case the fusion protein comprises a protein transduction domain 
derived from the TAT protein of HIV, the DNA sequence coding for said 
fusion protein preferably comprises the sequence 
5' TAC GGC CGC AAG AAG CGC CGC CAA CGC CGC CGC 3'. 

Such a preferred DNA sequence is for instance shown in SEQ ID NO: 11. 
In said sequence the 3' terminal codon ggc codes for the linker Gly, The 
DNA sequence of a suitable recombinase may be directly attached to said 
codon ggc. 

The fusion protein can be obtained by the following steps: 

1. Fusion of the recombinase coding region (e.g. encoding Cre: see amino 
acids 15 to 357 of SEQ ID NO: 2) with the sequence conferring protein 
translocation (e.g. the sequence encoding the TAT peptide 
YGRKKRRQRRR, SEQ ID NO: 10) using standard cloning protocols 
(Maniatis et al., Cold Spring Harbor Laboratory, New York (1989)) or 
chemical synthesis. 
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2. Generation of a construct for the expression of the fusion protein in 
prokaryotic or eukaryotic cells, e.g. in E. coli DH5a (Hanahan, 1. Mol. 
Biol.;166(4):557-80 (1983)) using the QIAexpress pQE vector (Qiagen, 
Hilden). 

3. Expression of the above mentioned fusion protein in prokaryotic or 
eukaryotic cells, e.g. in E. coli DH5a (Hanahan, 1983) 

4. Extraction and purification of the above mentioned fusion protein e.g. 
as described in IMagahara et al., Nat. Med., 4(12): 1449-52 (1998). 

In an experiment it was shown that TAT-mediated delivery of active Cre 
protein works with sufficient efficacy to facilitate inducible gene targeting 
both in cell lines and living organisms. In this experiment a vector for the 
expression of a TAT-Cre fusion protein in E. coli was constructed, TAT-Cre 
protein was expressed in E. coli and purified from bacterial lysates. To test 
the activity of the TAT-Cre protein in vitro, a reporter ceil line that 
contains a loxP-containing reporter construct was used. This reporter, 
when recombined by Cre recombinase, allows the expression of a 8- 
galacosidase gene. Further, a transgenic mouse strain carrying a loxP- 
flanked target was used to invest the activity of the TAT-Cre protein in " 
vivo. 

In a second experiment it was shown that VP22-mediated delivery of 
active Cre protein works with sufficient efficacy to facilitate inducible gene 
targeting. In this experiment Bacterial expression vectors were 
constructed for the production of VP22-Cre fusion proteins in E. coli. The 
activity of purified VP22-Cre proteins were tested using a reporter 
fibroblast cell line containing a loxP-flanked reporter construct. 

Thus, the injection of the purified fusion protefn of the present invention . 
into a living organism (e.g., a mouse) carrying a gene comprising the 
RRS-flanked target sequence (e.g., in an amount of 1 to 200, preferably 5 
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to 50 |jg per g body weight). To demonstrate the feasibility of the ' 
invention, a reporter mouse strain carrying an RRS-flanked cassette was 
used (Thorey etal., Mol. Cell Biol., 18(10):6164 (1998)). 

Analysis is achieved by determining the pattern of induced target gene 
recombination (e.g. through PCR analysis, Southern blot analysis or X-Gal 
staining on tissue sections; Maniatis et al., 1989; Gossler and Zachgo, 
Joyner AL (Ed.), Oxford University Press, Oxford, New York (1993)). 

The procedure's advantages over current technology are as follows*. 

/ ' ' ] 

(i) The absence of background recombination before administration of. 
the fusion protein. 

(ii) The reduction of time and resources which are necessary to combine 
the recombinase transgene and two copies of the RRS-flanked target 
gene by conventional breeding. 

In experiments it was shown the following: (a) With a suitable vector for 
the expression of a TAT-Cre fusion protein, a TAT-Cre fusion protein was 
expressed in E. coli and purified from bacterial lysates. 

(b) A reporter cell line containing a loxP-containing reporter construct was 
used to test the activity of the TAT-Cre protein in vitro. This reporter, 
when recombined by Cre recombinase, allows the expression of a B- 
galacosidase gene. - — 

(c) A transgenic mouse strain carrying a loxP-flanked target was used to 
invest the activity of the TAT-Cre protein in vivo. 

These experiments demonstrate that TAT-mediated delivery of active Cre 
protein works with sufficient efficacy to facilitate inducible gene targeting 
both in cell lines and living organisms. 
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Furthermore, bacterial expression vectors were constructed for the 
production of VP22-Cre fusion proteins in E. coli. The activity, of purified 
VP22-Cre proteins were tested using a reporter fibroblast cell line 
containing a loxP-rflanked reporter construct. These experiments 
demonstrate that VP22-mediated delivery of active Cre protein works with 
sufficient efficacy to facilitate inducible gene targeting. 

The invention is further illustrated by the following, non-limitative 
examples. 

Examples 

Materials and Methods 

Construction of pT7-TACS: The TAT-Cre coding region was generated by 
PCR using Advantage-HF PCR Kit (Clontech), 20 pmol of the primers 
TATcre sense (5'-atg cca tgg get acg gec gca aga age gec gec aac gec gec 
••■ ' gcg gca tgt cca att tac tga ccg tac acc-3'; SEQ ID NO:31) and TATcre 
antisense (5'-ttt egg ate cgc cgc ata.acc agt g-3'; SEQ ID NO: 32) and 10 
ng pCMV-I-Cre-pA (see SEQ ID NO:29) as template. The PCR reaction was 
performed using the following cycle profile: 2' 94 °C, 4 x (30" 94 °C min, 
30" 50 °C, 1' 72 °C), 12 x (30" 94 °C min, 30" 55 °C, 1' 72 °C) and 10' 
72 °C. The resulting PCR fragment was digested with Nco I and BamH I, 
treated with Klenow enzyme and ligated into the plasmid pBSII KS+ which 
had been opened with restriction enzyme BamH I, treated with Klenow 
and dephosphorylated with calf intestinal phosphatase. The resulting 
plasmid pBS TAT-5'cre was verified by DNA sequencing. The Plasmid 
pCMV-I-Cre-pA (SEQ ID NO:29) was digested with Age I and Sal I which 
released a 1,036 kb fragment containing the 3' part of the Cre coding 
region. This fragment was ligated into the plasmid pBS TAT-5'cre which 
had been opened with Age I and Sal I. 
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10 ng pBS-TATCre was subjected to PCR amplification using 20 pmol of 
primers FPA001 (5'-tat ate tag acc atg ggc tac ggc cgc aag aag c-3'; SEQ 
ID NO: 33) and FPA002 (5'-gct acc acg acc ttc gat acc ate gec ate ttc cag 
cag gcg c-3'; SEQ ID NO: 34). PCR was performed using 2,5 U Platinum 
Pfx DNA polymerase (Gibco BRL) and 2 x Enhancer Solution (Gibco BRL) 
according to the manufacturers protocol. The following cycle profile was 
used: 2' 94 °C, 25 x (30" 94 °C min, 15" 54,6 °C, 2'30" 68 °C). The 
amplified PCR fragment was purified using GFX columns (Amersham 
Pharmacia), digested with Xba I and ligated into the plasmid pASK57 
(Skerra and Arne, Gene 151: 131-135 (1994)) which had beer, opened 
with restriction enzymes Xba I and Eco 47 III and dephosphorylated with 
calf intestinal phosphatase. The resulting plasmid pASK75-TACS was 
digested with restriction enzymes Nco I and Hind III which released a 1,1 
kb fragment. The fragment was subsequently ligated into the plasmid pT7- 
7 (Studier and Moffatt, J. Mol. Biol. 189: 113-130 (1986)) which had been 
opened with restriction enzymes Nco I and Hind III and dephosphorylated 
with calf intestinal phosphatase resulting in the plasmid pT7-TACS (SEQ ID 
NO:16). 

Construction of DT7-VPCS: The Cre coding region was generated by PCR 
using Advantage-HF PCR Kit (Clontech), 20 pmol of the primers VP22cre 
sense (5'-taa eta gcg gee gca tgt cca att tac tga ccg tac ac-3'; SEQ ID 
NO:35) and VP22cre antisense (5'-tcg age ggc cgc cat cgc cat ctt cca gca 

ggc g-3'; SEQ ID NO:36) and 10 ng pgkcre-pA (SEQ ID NO:40) as 

template. The PCR reaction was performed using the following cycle 
profile: 2' 94 °C, 5 x (30" 94 °C, 30" 50 °C, 2* 72 °C), 15 x (30" 94 °C, 
30" 55 °C, 2' 72 °C) and 10' 72 °C. The resulting PCR fragment was 
digested with Not I and ligated into the plasmid pVP22/Myc-His 
(Invitrogen), which had been opened with restriction enzyme NotI, 
dephosphorylated with calf intestinal phosphatase. The resulting plasmid 
pVP22-cre myc/His was verified by DNA sequencing. 
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10 ng pVP22-cre myc/His was subjected to PCR amplification using 20 
pmol of primers FPA004 (5'-tat ate tag aca tat gac etc teg ccg etc cg-3'; 
SEQ ID NO:37) and FPA002 (SEQ ID NO:34). PCR was performed using 
2,5 U Platinum Pfx DNA polymerase (Gibco BRL) and 2 x Enhancer 
Solution (Gibco BRL) according to the manufacturers protocol. The 
following cycle profile was used: 2' 94 °C, 25 x (30" 94 °C min, 15" 54,6 
°C, 2'30" 68 °C). The amplified PCR fragment was purified using GFX 
columns (Amersham Pharmacia), digested with Xba I and ligated into the 
plasmid pASK57 (Skerra and Arne, Gene 151: 131-135 (1994)) which had 
been opened with restriction enzymes Xba I and Eco 47 III and 
dephosphorylated with calf intestinal phosphatase. The resulting plasmid 
pASK75-VPCS was digested with restriction enzymes Nde I and Hind III 
which released a 2,0 kb fragment. The fragment was subsequently ligated 
into the plasmid pT7-7 (Studier and Moffatt, J. Mol. Biol. 189: 113-130 
(1986)) which had been opened with restriction enzymes Nde I and Hind 
III and dephosphorylated with calf intestinal phosphatase resulting in the 
plasmid pT7-VPCS (SEQ ID NO: 17). 

Construction of DCRT7-AVPCS: The AVP22-Cre coding region was 
generated by PCR using Platinum Pfx DNA polymerase (Life Technologies), 
20 pmol of the primers FPA007 (5'-ttc cga aga cga cga aac acc-3'; SEQ ID 
NO: 38) and FPA008 (5'-tat att cga age tta tta ace acc gaa ctg cg-3'; SEQ 
ID NO:39) and- 30 ng pT7-VPCS (SEQ ID NO:17) as template. The PCR 
reaction was performed using the following cycle profile: 2' 94 °C, 25 x 
(30" 94 °C, 30" 61 °C, 2'30" 68 °C) and T 68 °C. The resulting 1,8 kb 
PCR fragment was digested with Nco I and Sfu I and ligated into the 
plasmid pCRT7/VP22-l (Invitrogen), which had been opened with 
restriction enzymes Nco I and Sfu I, and dephosphorylated with calf 
intestinal phosphatase. The resulting plasmid pCRT7-AVPCS (SEQ ID 
NO: 15) was verified by DNA sequencing. 
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Expression of the fusion proteins in E. coli: E. coli BL21(DE3)-RIL cells 
(Stratagene) were transformed with pT7-TACS and grown on LB agar 
plates containing 100 ug/ml ampicillin. E. coli BL21(DE3)-RP cells 
(Stratagene) were transformed with pT7-VPCS and grown on LB agar 
plates containing 100 yg/m\ ampicillin. E. coli BL21(DE3)-pLysS 
(Inyitrogen) were transformed with pCRT7-AVPCS and grown on LB agar 
plates containing 25 ug/ml kanamycine and 34 ug/ml chloramphenicol. 
Single colonies were isolated and used to prepare glycerol stocks. Eight 
5ml LB (Lura Bertani) aliquots containing antibiotics were inoculated with 
stabs from the glycerol stocks and grown overnight at 37°C with shaking. 
Two 5ml overnight cultures were each used to inoculate one of four 1L LB 
aliquots containing antibiotics and grown at 37°C with shaking. Growth 
rate was monitored by spectrophotometry at 578nm. When the cultures 
had obtained an OD578 = 0,5 expression of the fusion proteins were 
induced by the addition of 0,5 mM Isopropyl-B-D-l-thiogalactopyranosid 
(IPTG). Two hours after induction cells were harvested by centrifugation at 
12000xg and the pellet rapidly frozen in liquid nitrogen and stored 
immediately at -80°C. 

Purification of the fusion proteins from bacterial Ivsates: Each lOg cell 
pellet was resuspended on ice in 30ml Bicine buffer (50mM Bicine, pH 8,5) 
including one protease inhibitor tablet (Complete, Roche). Cells were lysed 
through threefold treatment (1500psi, 5 minutes) with the cell disruption 
bomb (Parr Instrument). 30ml of Benzonase (10000U, Merck) was added 
and cell extracts were incubated for 30 minutes at 4°C. Cell extracts were 
then centrifuged at i2,000xg (4°C). The pellet was redissolved in 8M 
urea, 50mM Bicine, lOOmM DTT, pH 8,5 by incubation for 16 hours at 
4°C. Protein extract was centrifuged at 31000xg and supernatant 
harvested. Protein extract was diluted in an equal volume of 
Chromatography buffer A (50mM Bicine, pH 8,5). PH was adjusted to pH 



WO 01/49832 



PCT/EP01/00060 



20 

8,5 and the extract was filtered through a 0,45|jm filter (Millipore). FPLC 
(Akta Explorer, Amersham Pharmacia) was performed using a cation 
exchange column (Sepharose SP, Column body HR_5/5 (0.5 x 5cm), 
column volume (CV) 1ml, linear flow 300cm/hour, Amersham Pharmacia). 
After addition of sample to FPLC column, buffer was exchanged with 
Chromatography buffer A at 10 CV. 

TAT-Cre and VP22-Cre fusion proteins were eluted from the column by 
gradient elution using chromatography buffer B (50mM Bicine, 1M NaCI, 
pH 8,5) using the following profile: 0 - 50 % buffer B, 0 CV; 50 % buffer 
B, 10 CV; 50 - 100 % buffer B (linear gradient), 20 CV; 100 % buffer B, 
10 CV. AVP22-Cre protein was eluted from the column by gradient elution 
using the following profile: 0 - 10 % buffer B, 0 CV; 10 % buffer B, 10 CV; 
10 - 30 % buffer B, 0 CV; 30 % buffer B, 10 CV; 30 - 100 % buffer B, 0 
CV; 100 % buffer B, 10 CV. Three 1,5ml fractions each containing purified 
fusion proteins were collected. Purity and concentration of protein 
fractions were determined by Coomassie blue stained SDS-PAGE gels and 
Western blot analysis using dilutions of BSA standard solutions. In addition 
protein content was determined using a Bradford assay (Coomassie Plus 
protein assay, Pierce). 

SDS-PAGE and Western blot analysis: SDS-PAGE and Coomassie staining 
was performed according to standard protocols (Maniatis et al., Cold 
Spring Harbor Laboratory, New York (1989)) using 4-12 % gradient 
SDS-polyacrylamide gels (NuPAGE, Invitrogen, cat. no,: NP0321). 
Western blot analysis was performed using a Semi-Try Blotting Chamber 
(Biorad) and nitrocellulose membranes (0,2 urn; Schleicher & Schuell) 
according to the manufacturers protocols. The fusion proteins were 
detected by using an alkaline phosphatase-conjugated anti-strep tag 
antibody (IBA, Cat. No.: 2-1503-001) according to the manufacturers 
protocol. 
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Generation of the M5 Pax8 Cre reporter cell line: The SV40-transformed 
murine embryonic fibroblast line MEF5/5 (Schwenk et al., Nucl Acids Res 
26(6), 1427-32 (1998)) was transfected with the vector pPGKpaXl 
(Kellendonk et al, Nucl. Acids Res. 24, 1404-11 (1996)). 10 6 MEF5/5 cells 
were electroporated with 20 ug pPGKpaXl plasmid DNA linearised with 
Sea I and plated into 48-well-plates. The cells were cultured in 
DMEM/Glutamax medium (Life Technologies) supplemented with 10 % 
fetal calf serum at 37°C, 10 % C0 2 in humid atmosphere. Two days after 
transfection the medium was supplemented with 5 ug/ml puromycine 
(Calbiochem) for the selection of stable integrants. 14 puromycine- 
resistant clones were expanded and tested by transien transfection with 
the Cre expression vector pPGK-Cre-pA (SEQ ID NO: 40). In two out of 
the 14 puromycine-resistant clones, the expression of B-galactosidase 
could be detected by staining with X-Gal. One of these clones, M5Pax8, 
was used as Cre reporter cell line. 

Transfection and measurement of B-aalactosidase activity: Fibroblasts 
(10 6 cells per 24 well plate (Falcon)) were transfected with 25 ng pCMV-I- 
Cre-pA (see SEQ ID NO: 29) or pCMV-I-6-pA (see SEQ ID NO:30).plasmids 
using the FuGene transfection reagent (Roche Diagnostics). After 2 days 
the cells were lysed and the B-galactosidase activities were determined 
with the B-galactosidase reporter gene assay (Roche. Diagnostics) 
according to the manufacturers guidelines using a Lumistar luminometer 
(MWG). 

Histochemical detection of B-aalactosidase activity: To quantitate B- 
galactosidase expression, fibroblast cells were washed once with 
phosphate buffered saline (PBS), and the cells were.fixed for 5 minutes at 
room temperature in a solution of 4% formaldehyde in PBS. Next, the cells 
were washed twice with PBS and finally incubated in staining solution for 
24 hours at 37°C (staining solution: 5 mM K3(Fe(CN)6), 5mM 
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K4(Fe(CN)6), 2mM MgCI2, lmg/ml X-Gal (BioMol) in PBS). Blue stained, 
B-galactosidase positive cells were detected and distinguished from 
negative (transparent) cells in a cell culture binocular microscope under 
200x magnification. For each determination a minimum of 200 cells was 
counted. 

PCR detection of Cre-mediated recombination: Genomic DNA extracted 
from tissue samples was subjected to PCR using Taq-polymerase (Gibco 
BRL Cat. No. 10342-020) using 20 pmol of each primer (sense: 5* -CAT 
CTC CGG GCC TTT CGA CCT G - 3', antisense: 5' -GCG ATC GGT GCG 
GGC CTC TTC - 3'; SEQ ID Nos: 41 and 42, respectively). PCR was 
performed using the following cycle profile: 2' 94°C, 35 x (30" 94°C, 30" 
55 °C, V 72 °C), 10 min 72 °C. PCR products were separated on a 1,2 % 
agarose gel. 

Example 1 

The vector pT7-TACS (SEQ ID NO: 16) was constructed for the expression 
of a TAT-Cre fusion protein in E. coli. The plasmid contains the coding 
region of the 11 amino acid protein transduction domain of the wild-type 
HIV TAT protein (Green and Loewenstein, Cell, 55(6): 1179-88 (1988); 
Frankel and Pabo, Cell, 55(6): 1189-93 (1988); SEQ ID NO:10) fused to 
the N-terminus of Cre recombinase protein sequence. The 10-amino-acid 
strep tag at the C-terminus allows the detection and purification of the 
fusion protein using specific antibodies (Schmidt and Skerra, J. 
Chromatogr A 676: 337-345 (1994)). The protease factor Xa recognition 
site (Ile-Glu-Gly-Arg) permits the removal of the strep tag by proteolytic 
cleavage. The estimated molecular weight of the TAT-Cre fusion protein is 
42 kDa. A scheme of the TAT-Cre expression vector is depicted in figure 2. 
For the expression of TAT-Cre, the E. coli strain BL21(DE3)-RIL 
(Stratagene) was used. This strain carries an IPTG-inducible T7 
polymerase gene and additional copies of the tRNA genes for the' Yare 
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codons' argU, ileY and leuW. 

E. coli BL21(DE3)-RIL cells were transformed with pT7-TACS and grown In 
LB medium containing 100 ug/ml ampicillin. The expression of the 40 kDa 
TAT-Cre fusion protein could be strongly induced by the addition of 0,5 
mM IPTG to the culture medium. Analysis of protein lysates revealed that 
approximately 50 % of TAT-Cre protein accumulated as insoluble inclusion 
bodies. The inclusion bodies where extracted and dissolved in 8 M urea. 
TAT-Cre was subsequently purified from this fraction using ion exchange 
chromatography. The quantity and purity of TAT-Cre protein was 
determined using Coomassie stained SDS-PAGE gels and Western blot 
analysis (figure 3). The purification process yielded TAT-Cre protein 
extracts of 64 % purity and a concentration of 100 ug/ml. 
To analyse the ability of the purified TAT-Cre protein to transduce into 
cultured cells, we used the fibroblast cell line M5Pax8 (R. Kuhn, 
unpublished) that contains a loxP-containing reporter construct. This 
reporter, when recombined by Cre recombinase, allows the expression of 
a B-galacosidase gene (Buchholz et al, Nucleic Acids Res. 24, 4256-4262, 
1996). Cells were cultured for 18 h with increasing concentrations of TAT- 
Cre protein in serum-free medium and analysed 4 days later for B- 
Galacosidase activity. Staining with X-Gal showed that > 50 % of the cells 
treated with 13,8 ug/ml TAT-Cre protein expressed 6-galactosidase 
indicating recombination of the loxP-flanked reporter construct had 
occurred (figure 4). Measurement of 6-galactosidase activity in cell 
lysates revealed an up to 30-fold higher level of 6-galactosidase activity 
in comparison to cells which had been transiently transfected with an 
eukaryotic Cre expression vector (figure 5). 

To investigate the activity of TAT-Cre protein in a living organism, we used 
a transgenic mouse strain carrying a loxP-flanked target for Cre-mediated 
recombination (Thorey etal., 1998, Mol. Cell. Biol. 18: 3081 - 3088). Mice 
where treated three times with intraperitoneal injections of 75 ug TAT Cre 
protein at two-day-intervals and analysed 2 days later. Genomic DNA was 
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isolated from a variety of organs and subjected to PCR amplification which ' 
specifically amplifies a 400 bp fragment of the recombined allele. The 
deleted allele could be detected in multiple tissues from treated mice 
indicating TAT-Cre-mediated recombination in these organs (figure 6). 
This experiments demonstrates that TAT-mediated delivery of active Cre 
protein works with sufficient efficacy to facilitate inducible gene targeting 
in cell lines and in living organisms. 

Example 2 

The vectors pT7-VPCS (SEQ ID NO: 17) and pCRT7-AVPCS (SEQ ID NO: 15) 
were constructed for the expression of VP22-Cre and AVP22-Cre fusion 
proteins in E. coli. The VP22-Cre gene of pT7-VPCS contains the full length 
protein translocation domain of the HSV VP22 protein (Elliott and O'Hare, 
Cell, 88(2): 223-33 (1987), whereas the AVP22-Cre gene of pCRT7-AVPCS 
contains a truncated VP22 protein transduction domain (amino acids 159 - 
301; Invitrogen; aa 16-157 of SEQ ID NO:14) fused to the N-terminus of 
Cre recombinase protein sequence. A 10-amino-acid strep tag at the C- 
terminus of Cre protein sequence allows the detection and purification of 
the fusion proteins using specific antibodies (Schmidt and Skerra, J. 
Chromatogr A 676: 337-345 (1994)). The protease factor Xa recognition 
site permits the removal of the Strep tag by proteolytic cleavage. The 
estimated molecular weight is 75 kDa for VP22-Cre protein and 60 kDa for 
AVP22-Cre protein. A scheme of the vectors pT7-VPCS and pCRT7-AVPCS 
is depicted in figure 7. 

E. coli BL21(DE3)-RIP cells (Stratagene) were transformed with pT7-VPCS 
and cultured in LB medium containing 100 ug/ml ampicillin. E. coli 
BL21(DE3)-pLysS cells (Stratagene) were transformed with pCRT7-AVPCS 
and cultured in LB medium containing 25 ug/ml kanamycine and 34 ug/ml 
chloramphenicol. Expression of the VP22-Cre and AVP22-Cre fusion 
proteins could be induced by the addition of 0,5 mM IPTG to the culture 
medium. Analysis of protein extracts using Coomassie staining and 
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Western blotting of SDS-PAGE gels revealed that 50 - 60 % of VP22-Cre 
and AVP22-Cre proteins accumulated as Insoluble inclusion bodies. The 
inclusion bodies where extracted and dissolved in 8 M urea. VP22-Cre and 
AVP22-Cre fusion proteins were subsequently purified using ion exchange 
chromatography. The quantity and purity of the isolated VP22-Cre and A 
VP22-Cre fusion proteins was determined using Coomassie stained SDS- 
PAGE gels and Western blot analysis (figure 8). 
To analyse the ability of the purified fusion proteins to transduce into 
cultured cells, we used the fibroblast cell line M5Pax8 that contains a loxP- 
containing reporter construct. When recombined by Cre recombinase, the 
reporter allows the expression of a 6-galacosidase gene (Buchholz et a!, 
Nucleic Acids Res. 24, 4256-4262, 1996). The ceils where cultured for 18 
h with increasing concentrations of VP22-Cre and AVP22-Cre in serum-free 
medium and analysed 4 days later for 6-Galacosidase activity. Staining 
with X-Gal showed ~2 % blue cells in the cultures treated with up to 15 
ug/ml AVP22-Cre indicating recombination of the loxP-flanked reporter 
construct had occurred. In contrast, cell cultures treated with up to 0,5 
ug/ml VP22-Cre did not show any X-gal staining (figure 9). Measurement 
of cell lysates revealed a strong increase of 6-galactosidase activity upon A 
VP22-Cre treatment when compared to untreated cells (figure 10). 
Genomic DNA was isolated fand subjected to PCR amplification that 
specifically amplifies a 250 bp fragment of the recombined allele. The 
deleted allele could be detected in cells treated with both VP22-Cre and A 
VP22-Cre fusion proteins (figure 11). 

This experiment demonstrates that VP22-mediated delivery of active Cre 
protein works with sufficient efficacy to facilitate inducible gene targeting. 
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SEQUENCE LISTING 
<110> ARTEMIS Pharmaceuticals GmbH 

<120> Transduction of recombinases for inducible gene 
targeting 

<130> 010007wo/JH/ml 

<140> 
<141> 

<160> 42 

<170> Patentln Ver. 2.1 



<210> 1 
<211> 1074 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: DNA sequence 
coding for a fusion protein TAT-Cre 



<220> 
<221> CDS 
<222> (1) . 



(1071) 



<400> 1 

atg ggc tac ggc cgc aag aag cgc cgc caa cgc cgc cgc ggc atg tec 
Met Gly Tyr Gly Arg Lys Lys Arg Arg Gin Arg Arg Arg Gly Met Ser 
15 10 15 



48 



aat tta ctg acc gta cac caa aat ttg cct gca tta ccg gtc gat gca 96 
Asn Leu Leu Thr Val His Gin Asn Leu Pro Ala Leu Pro Val Asp Ala 
20 25 30 

acg agt gat gag gtt cgc aag aac ctg atg gac atg ttc agg gat cgc 144 
Thr Ser Asp Glu Val Arg Lys Asn Leu Met Asp Met Phe Arg Asp Arg 
35 40 45 

cag gcg ttt tct gag cat acc tgg aaa atg ctt ctg tec gtt tgc egg 192 
Gin Ala Phe Ser Glu His Thr Trp Lys Met Leu Leu Ser Val Cys Arg 
50 55 60 

teg tgg gcg gca tgg tgc aag ttg aat aac egg aaa tgg ttt ccc gca 240 
Ser Trp Ala Ala Trp Cys Lys Leu Asn Asn Arg Lys Trp" Phe Pro Ala 
65 70 -75 80 

gaa cct gaa gat gtt cgc gat tat ctt eta tat ctt cag gcg cgc ggt 288 
Glu Pro Glu Asp Val Arg Asp Tyr Leu Leu Tyr Leu Gin Ala Arg Gly 
85 90 95 



ctg gca gta aaa act ate cag caa cat ttg ggc cag eta aac atg ctt 
Leu Ala Val Lys Thr lie Gin Gin His Leu Gly Gin Leu Asn Met Leu 
100 105 110 



336 



cat cgt egg tec ggg ctg cca cga cca agt gac age aat get gtt tea* 384 
His Arg Arg Ser. Gly Leu Pro Arg Pro Ser Asp Ser Asn Ala Val Ser 
115 120 125 
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ctg gtt atg egg egg ate cga aaa gaa aac gtt gat gec ggt gaa cgt 432 
Leu Val Met Arg Arg lie Arg Lys Glu Asn Val Asp Ala Gly Glu Arg 
130 135 140 

gca aaa cag get eta gcg ttc gaa cgc act gat ttc gac cag gtt cgt 480 
Ala Lys Gin Ala Leu Ala Phe Glu Arg Thr Asp Phe Asp Gin Val Arg 
145 150 155 160 

tea etc atg gaa aat age gat cgc tgc cag gat ata cgt aat ctg gca 528 
Ser Leu Met Glu Asn Ser Asp Arg Cys Gin Asp He Arg Asn Leu Ala 
165 170 175 

ttt ctg ggg att get tat aac ace ctg tta cgt ata gee gaa att gee 576 
Phe Leu Gly He Ala Tyr Asn Thr Leu Leu Arg He Ala Glu He Ala 
180 185 190 

agg ate agg gtt aaa gat ate tea cgt act gac ggt ggg aga atg tta 624 
Arg He Arg Val Lys Asp He Ser Arg Thr Asp Gly Gly Arg Met Leu 
195 200 205 

ate eat att ggc aga acg aaa aeg cirg gtx age ace gca ggt gta gag 672 
He His He Gly Arg Thr Lys Thr Leu Val Ser Thr Ala Gly Val Glu 
210 215 220 

aag gca ctt age ctg ggg gta act aaa ctg gtc gag cga tgg att tec 720 
Lys Ala Leu Ser Leu Gly Val Thr Lys Leu Val Glu Arg Trp He Ser 
225 230 235 240 

gtc tct ggt gta get gat gat ccg aat aac tac ctg ttt tgc egg gtc 768 
Val Ser Gly Val Ala Asp Asp Pro Asn Asn Tyr Leu Phe Cys Arg Val 
245 250 * 255 

aga aaa aat ggt gtt gee gcg cca tct gee ace age cag eta tea act 816 
Arg Lys Asn Gly Val Ala Ala Pro Ser Ala Thr Ser Gin Leu Ser Thr 
260 265 270 

cgc gee ctg gaa ggg att ttt gaa gca act cat cga ttg att tac ggc 864 
Arg Ala Leu Glu Gly He Phe Glu Ala Thr His Arg Leu He Tyr Gly 
275 280 285 

get aag gat gac tct ggt cag aga tac ctg gee tgg tct gga cac agt 912 
Ala Lys Asp Asp Ser Gly Gin Arg Tyr Leu Ala Trp Ser Gly His Ser 
290 295 300 

gee cgt gtc gga gee gcg cga gat atg gec cgc get gga gtt tea ata 960 
Ala Arg Val Gly Ala Ala Arg Asp Met Ala Arg Ala Gly Val Ser He 
305 310 315 320 

ccg gag ate atg caa get ggt ggc tgg ace aat gta aat att gtc atg ' 1008 
Pro Glu He Met Gin Ala Gly Gly Trp Thr Asn Val Asn He Val Met 
325 330 335 

aac tat ate cgt aac ctg gat agt gaa aca ggg gca atg gtg cgc ctg 1056 
Asn Tyr He Arg Asn Leu Asp Ser Glu Thr Gly Ala Met Val Arg Leu 
340 345 350 

ctg gaa gat ggc gat tag 1074 
Leu Glu Asp Gly Asp 
355 



<210> 2 
<211> 357 
<212> PRT 
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<213> Artificial Sequence 

<223> Description of Artificial Sequence: DNA sequence 
coding for a fusion protein TAT-Cre 

<400> 2 

Met Gly Tyr Gly Arg Lys Lys Arg Arg Gin Arg Arg Arg Gly Met Ser 
15 10 15 

Asn Leu Leu Thr Val His Gin Asn Leu Pro Ala Leu Pro Val Asp Ala 
20 25 30 

Thr Ser Asp Glu Val Arg Lys Asn Leu Met Asp Met Phe Arg Asp Arg 
35 40 45 

Gin Ala Phe Ser Glu His Thr Trp Lys Met Leu Leu Ser Val Cys Arg 
50 55 60 

Ser Trp Ala Ala Trp Cys Lys Leu Asn Asn Arg Lys Trp Phe Pro Ala 
65 70 75 80 

Glu Pro Glu Asp Val Arg Asp Tyr Leu Leu Tyr Leu Gin Ala Ara Glv 
85 90 95 

Leu Ala Val Lys Thr lie Gin Gin His Leu Gly Gin Leu Asn Met Leu . ' *' 
100 105 110 

His Arg Arg Ser Gly Leu Pro Arg Pro Ser Asp Ser Asn Ala Val Ser 
115 120 125 

Leu Val Met Arg Arg He Arg Lys Glu Asri Val Asp Ala Gly Glu Arg 
130 135 140 

Ala Lys Gin Ala Leu Ala Phe Glu Arg Thr Asp Phe Asp Gin Val Arg 
145 150 155 160 

Ser Leu Met Glu Asn Ser Asp Arg Cys Gin Asp He Arg Asn Leu Ala 
165 170 175 

Phe Leu Gly He Ala Tyr Asn Thr Leu Leu Arg He Ala Glu He Ala 
180 • 185 190 

Arg He Arg Val Lys Asp He Ser Arg Thr Asp Gly Gly Arg Met Leu 
195 200 *' 205 

He His He Gly Arg Thr Lys Thr Leu Val Ser Thr Ala Gly Val Glu 
210 215 220 

Lys Ala Leu Ser Leu Gly Val Thr Lys Leu Val Glu Arg Trp He Ser 
225 230 235 240 

Val Ser Gly Val Ala Asp Asp Pro Asn Asn Tyr Leu Phe Cys Arg Val 
245 250 255 

Arg Lys Asn Gly Val Ala Ala Pro Ser Ala Thr Ser Gin Leu Ser Thr 
260 265 270 

Arg Ala Leu Glu Gly He Phe Glu Ala Thr His Arg Leu He Tyr Gly 
275 280 285 

Ala Lys Asp Asp Ser Gly Gin Arg Tyr Leu Ala Trp Ser Gly His Ser 
290 295 300 

Ala Arg Val Gly Ala Ala Arg Asp Met Ala Arg Ala Gly Val Ser He 
305 310 315 320 
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Pro Glu He Met Gin Ala Gly Gly Trp Thr Asn Val Asn He Val Met 
325 330 335 

Asn Tyr He Arg Asn Leu Asp Ser Glu Thr Gly Ala Met Val Arg Leu 
340 345 350 

Leu Glu Asp Gly Asp 
355 



<210> 3 
<211> 1317 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: DNA sequence 
coding for a fusion protein TAT-Flpe 

<220> 

<221> CDS 

<222> (1) . . (1311) 

<400> 3 

atg ggc tac ggc cgc aag aag cgc cgc caa cgc cgc cgc ggc atg agt 48 

Met Gly Tyr Gly Arg Lys Lys Arg Arg Gin Arg Arg Arg Gly Met Ser 

15 10 15 

caa ttt gat ata tta tgt aaa aca cca cct aag gtc ctg gtt cgt cag 96 
Gin Phe Asp He Leu Cys Lys Thr Pro Pro Lys Val Leu Val Arg Gin 
20 25 30 

ttt gtg gaa agg ttt gaa aga cct tea ggg gaa aaa ata gca tea tgt 14 4 
Phe Val Glu Arg Phe Glu Arg Pro Ser Gly Glu Lys He Ala Ser Cys 
35 40 45 

get get gaa eta acc tat tta tgt tgg atg att act cat aac gga aca 192 
Ala Ala Glu Leu Thr Tyr Leu Cys Trp Met He Thr His Asn Gly Thr 
50 55 60 

gca ate aag aga gee aca ttc atg age tat aat act ate ata age aat 240 
Ala He Lys Arg Ala Thr Phe Met Ser Tyr Asn Thr He He Ser Asn 
65 70 75 80 

teg ctg agt ttc gat att gtc aac aaa tea etc cag ttt aaa tac aag 288 
Ser Leu Ser Phe Asp He Val Asn Lys Ser Leu Gin Phe Lys Tyr Lys 
85 - 90 95 

acg caa aaa gca aca att ctg gaa gee tea tta aag aaa tta att cct 336 
Thr Gin Lys Ala Thr He Leu Glu Ala Ser Leu Lys Lys Leu He Pro 
100 105 110 

get tgg gaa ttt aca att att cct tac aat gga caa aaa cat caa tct 384 
Ala Trp Glu Phe Thr He He Pro Tyr Asn Gly Gin Lys His Gin Ser 
115 120 125 

gat ate act gat att gta agt agt ttg caa tta eag ttc gaa tea teg 432 
Asp He. Thr Asp He Val Ser Ser Leu Gin Leu Gin Phe Glu Ser Ser 
130 135 140 

gaa gaa. gca gat aag gga aat age cac agt aaa aaa atg ctt aaa gca 480 
Glu Glu Ala Asp Lys Gly Asn Ser His Ser Lys Lys Met Leu Lys Ala 
145 150 155 160 
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ctt eta agt gag ggt gaa age ate tgg gag ate act gag aaa ata eta 528 

Leu Leu Ser Glu Gly Glu Ser lie Trp Glu lie Thr Glu Lys lie Leu 
165 170 175 

aat teg ttt gag tat acc teg aga ttt aca aaa aca aaa act tta tac 576 
Asn Ser Phe Glu Tyr Thr Ser Arg Phe Thr Lys Thr Lys Thr Leu Tyr 
180 : 185 190 

caa ttc etc ttc eta get act ttc ate aat tgt gga aga ttc age gat 624 
Gin Phe Leu Phe Leu Ala Thr Phe He Asn Cys Gly Arg Phe Ser Asp 
195 200 205 

att aag aac gtt gat ccg aaa tea ttt aaa tta gtc caa aat aag tat 672 
He Lys Asn Val Asp Pro Lys Ser Phe Lys Leu Val Gin Asn Lys Tyr 
210 215 220 

ctg gga gta ata ate cag tgt tta gtg aca gag aca aag aca age gtt 720 
Leu Gly Val He He Gin Cys Leu Val Thr Glu Thr Lys Thr Ser Val 
225 230 235 240 

agt agg cac ata tac ttc ttt age gca agg ggt agg ate gat cca ctt 768 
Ser Arg His He Tyr Phe Phe Ser Ala Arg Gly Arg He Asp Pro Leu 
245 250 • 255 

gta tat ttg gat gaa ttt ttg agg aat tct gaa cca gtc eta aaa cga 816 
Val Tyr Leu Asp Glu Phe Leu Arg Asn Ser Glu Pro Val Leu Lys Arg 
260 265 270 

gta aat agg acc ggc aat tct tea age aac aaa cag gaa tac caa tta 864 
Val Asn Arg Thr Gly Asn Ser Ser Ser Asn Lys Gin Glu Tyr Gin Leu 
275 280 285 

tta aaa gat aac tta gtc aga teg tac aac aag get ttg aag aaa aat 912 
Leu Lys Asp Asn Leu Val Arg Ser Tyr Asn Lys Ala Leu Lys Lys Asn 
290 295 300 

gcg cct tat cca ate ttt get ata aag aat ggc cca aaa tct cac att 960 
Ala Pro Tyr Pro He Phe Ala He Lys Asn Gly Pro Lys Ser His He 
305 310 315 320 

gga aga cat ttg atg acc tea ttt ctg tea atg aag ggc eta acg gag 1008 
Gly Arg His Leu Met Thr Ser Phe Leu Ser Met Lys Gly Leu Thr Glu 
325 330 ^ 335 

ttg act aat gtt gtg gga aat tgg age gat aag cgt get tct gee gtg 1056 
Leu Thr Asn Val Val Gly Asn Trp Ser Asp Lys Arg Ala Ser Ala Val 
340 345 350 

gee agg aca acg tat act cat cag ata aca gca ata cct gat cac tac 1104 
Ala Arg Thr Thr Tyr Thr His Gin He Thr Ala He Pro Asp His Tyr 
355 360 365 

ttc gca eta' gtt tct egg tac tat gca tat gat cca ata tea aag gaa 1152 
Phe Ala Leu Val Ser Arg Tyr Tyr Ala Tyr Asp Pro lie Ser Lys Glu 
370 375 380 

atg ata gca ttg aag gat gag act aat cca att gag gag tgg cag cat 1200 
Met He Ala Leu Lys Asp Glu Thr Asn Pro He Glu Glu Trp Gin His 
385 390 395 400 

ata gaa cag eta aag ggt agt get gaa gga age ata cga tac ccc gca 1248 
He Glu Gin Leu Lys Gly Ser Ala Glu Gly .Ser He Arg Tyr Pro Ala 
405 410 ' 415 
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tgg aat ggg ata ata tea cag gag gta eta gac tac ctt tea tec tac 1296 

Trp Asn Gly lie He Ser Gin Glu Val Leu Asp Tyr Leu Ser Ser Tyr 

420 425 430 

ata aat aga cgc ata taatga 1317 
He Asn Arg Arg He 
435 : 



<210> 4 
<211> 437 
<212> PRT 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: DNA sequence 
coding for a fusion protein TAT-Flpe 

<400> 4 

Met Gly Tyr Gly Arg Lys Lys Arg Arg Gin Arg Arg Arg Gly Met Ser 
1 5 10 15 

Gin Phe Asp He Leu Cys Lys Thr Pro Pro Lys Val Leu Val Arg Gin 
20 25 30 

Phe Val Glu Arg Phe Glu Arg Pro Ser Gly Glu Lys He Ala Ser Cys 
35 40 45 

Ala Ala Glu Leu Thr Tyr Leu Cys Trp Met He Thr His Asn Gly Thr 
50 55 60 

Ala He Lys Arg Ala Thr Phe Met Ser Tyr Asn Thr He He Ser Asn 
65 70 75 80 

Ser Leu Ser Phe Asp He Val Asn Lys Ser Leu Gin Phe Lys Tyr Lys 
85 90 95 

Thr Gin Lys Ala Thr He Leu Glu Ala Ser Leu Lys Lys Leu He Pro 
100 105 110 

Ala Trp Glu Phe Thr He He Pro Tyr Asn Gly Gin Lys His Gin Ser 
115 120 125 

Asp He Thr Asp He Val Ser Ser Leu Gin Leu Gin Phe Glu Ser Ser 
130 135 140 

Glu Glu Ala Asp Lys Gly Asn Ser His Ser Lys Lys Met Leu Lys Ala 
145 150 155 160 

Leu Leu Ser Glu Gly Glu Ser He Trp Glu lie Thr Glu Lys He Leu 
165 • 170 175 

Asn Ser Phe Glu Tyr Thr Ser Arg Phe Thr Lys Thr Lys Thr Leu Tyr 
180 " 185 190 

Gin Phe Leu Phe Leu Ala Thr Phe He Asn Cys Gly Arg Phe Ser Asp 
195 200 205 

He Lys Asn Val Asp Pro Lys Ser Phe Lys Leu Val Gin Asn Lys Tyr 
210 215 220 

Leu Gly- Val He He Gin Cys Leu Val Thr Glu Thr Lys Thr Ser Val 
225 230 235 240 



WO 01/49832 PCT7EP01/00060 

32 

Ser Arg His lie Tyr Phe Phe Ser Ala Arg Gly Arg He Asp Pro Leu 
245 250 255 

Val Tyr Leu Asp Glu Phe Leu Arg Asn Ser Glu Pro Val Leu Lys Arg 
260 265 270 

Val Asn Arg Thr Gly Asn Ser Ser Ser Asn Lys Gin Glu Tyr Gin Leu 
275 * 280 285 

Leu Lys Asp Asn Leu Val Arg Ser Tyr Asn Lys Ala Leu Lys Lys Asn 
290 295 300 

Ala Pro Tyr Pro He Phe Ala He Lys Asn Gly Pro Lys Ser His He 
305. 310 315 320 

Gly Arg His Leu Met Thr Ser Phe Leu Ser Met Lys Gly Leu Thr Glu 
325 330 335 

Leu Thr Asn Val Val Gly Asn Trp Ser Asp Lys Arg Ala Ser Ala Val 
340 345 350 

Ala Arg Thr Thr Tyr Thr His Gin He Thr Ala He Pro Asp His Tyr 
355 360 365 

Phe Ala Leu Val Ser Arg Tyr Tyr Ala Tyr Asp Pro He Ser Lys Glu 
370 375 380 

Met He Ala Leu Lys Asp Glu Thr Asn Pro He Glu Glu Trp Gin His 
385 390 395 400 

He Glu Gin Leu Lys Gly Ser Ala Glu Gly Ser He Arg Tyr Pro Ala 
405 410 415 

Trp Asn Gly He He Ser Gin Glu Val Leu Asp Tyr Leu Ser Ser Tyr 
420 425 430 

lie Asn Arg Arg He 
435 

<210> 5 
<211> 2004 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of . Artificial Sequence: DNA sequence 
coding for a fusion protein VP22-Cre 

<220> 

<221> CDS 

<222> (1) . . (2001) 

<400> 5 

atg acc tct cgc cgc tec gtg aag teg ggt ccg egg gag gtt ccg cgc 48 

Met Thr Ser Arg Arg Ser Val Lys Ser Gly Pro Arg Glu Val Pro Arg 
15 10 - 15 



gat gag tac gag gat 
Asp Glu Tyr Glu Asp 
20 



ctg tac tac acc ccg tct 
Leu Tyr Tyr Thr Pro Ser 
25 



tea ggt atg gcg agt 96 
Ser Gly Met Ala Ser 
30 
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ccc gat agt ccg cct gac acc tec cgc cgt ggc gec eta cag aca cgc 14 4 
Pro Asp Ser Pro Pro Asp Thr Ser Arg Arg Gly Ala Leu Gin Thr Arg 
35 40 45 y 

teg cgc cag agg ggc gag gtc cgt ttc gtc cag tac gac gag teg gat 192 
Ser Arg Gin Arg Gly Glu Val Arg Phe Val Gin Tyr Asp Glu Ser Asp 

er 60 



240 



288 



50 55 

tat gee etc tac ggg ggc teg tct tec gaa gac gac gaa cac ccg gag 
Tyr Ala Leu Tyr Gly Gly Ser Ser Ser Glu Asp Asp Glu His Pro Glu 
65 70 75 80 

gtc ccc egg acg egg cgt ccc gtt tec ggg gcg gtt ttg tec ggc ccg 
Val Pro Arg Thr Arg Arg Pro Val Ser Gly Ala Val Leu Ser Gly Pro 
85 90 95 

ggg cct gcg egg gcg cct ccg cca ccc get ggg tec gga ggg gee gga 336 
Gly Pro Ala Arg Ala Pro Pro Pro Pro Ala Gly Ser Gly Gly Ala Gly 
100 105 no 

cgc aca ccc acc aca gec ccc egg gee ccc cga acc cag egg gtg gcg 384 
Arg Thr Pro Thr Thr Ala Pro Arg Ala Pro Arg Thr Gin Arg Val Ala 
115 120 125 

act aag gee ccc gcg gec ccg gcg gcg gag acc acc cgc ggc agg aaa 432 
Thr Lys Ala Pro Ala Ala Pro Ala Ala Glu Thr Thr Arg Gly Arq Lvs 
130 135 140 

teg gee cag cca gaa tec gec gca etc cca gac gec ccc gcg teg acg 480 
Ser Ala Gin Pro Glu Ser Ala Ala Leu Pro Asp Ala Pro Ala Ser Thr 
145 150 155 160 

gcg cca acc cga tec aag aca ccc gcg cag ggg ctg gee aga aag ctg ' 528 
Ala Pro Thr Arg Ser Lys Thr Pro Ala Gin Gly Leu Ala Arg Lys Leu 
165 170 175 

cac ttt age acc gec ccc cca aac ccc gac gcg cca tgg acc ccc egg 576 
His Phe Ser Thr Ala Pro Pro Asn Pro Asp Ala Pro Trp Thr Pro Ara 
180 185 190 

gtg gec ggc ttt aac aag cgc gtc ttc tgc gec gcg gtc ggg cgc ctg 624 
Val Ala Gly Phe Asn Lys Arg Val Phe Cys Ala Ala Val Gly Arg Leu 
195 200 205 

gcg gec atg cat gee egg atg gcg gcg gtc cag etc tgg gac atg teg 672 
Ala Ala Met His Ala Arg Met Ala Ala Val Gin Leu Trp Asp Met Ser 
210 215 220 

cgt ccg cgc aca gac gaa gac etc aac gaa etc ctt ggc ate acc acc 720 
Arg Pro Arg Thr Asp Glu Asp Leu Asn Glu Leu Leu Gly lie Thr Thr 
225 230 235 240 

ate cgc gtg acg gtc tgc gag ggc aaa aac ctg ctt cag cgc gee aac 768 
lie Arg Val Thr Val Cys Glu Gly Lys Asn Leu Leu Gin Arg Ala Asn 
245 250 255 

gag ttg gtg aat cca gac gtg gtg cag gac gtc gac gcg gee acg gcg 816 
Glu Leu Val Asn Pro Asp Val Val Gin Asp Val *Asp Ala Ala Thr Ala ' 
2 ^0 265 270 

act cga ggg cgt tct gcg gcg teg cgc ccc acc gag cga cct cga gec 864 
Thr Arg Gly Arg Ser Ala Ala Ser Arg Pro Thr Glu Arg Pro Arg Ala 
275 280 285 
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cca gcc cgc tec get tct cgc ccc aga egg ccc gtc gag ggt ace gag 912 
Pro Ala Arg Ser Ala Ser Arg Pro Arg Arg Pro Val Glu Gly Thr Glu 
290 295 ' 300 

etc gga tec act agt cca gtg tgg tgg aat tct gca gat ate cag cac 960 
Leu Gly Ser Thr Ser Pro Val Trp Trp Asn Ser Ala Asp He Gin His 
305 310 315 320 

agt ggc ggc cgc atg tec aat tta ctg acc gta cac caa aat ttg cct 1008 
Ser Gly Gly Arg Met Ser Asn Leu Leu Thr Val His Gin Asn Leu Pro 
325 330 335 

gca tta ccg gtc gat gca acg agt gat gag gtt cgc aag aac ctg atg 1056 
Ala Leu Pro Val Asp Ala Thr Ser Asp Glu Val Arg Lys Asn Leu Met 
340 345 350 

gac atg ttc agg gat cgc cag gcg ttt tct gag cat acc tgg aaa atg 1104 
Asp Met Phe Arg Asp Arg Gin Ala Phe Ser Glu His Thr Trp Lys Met 
355 360 365 

-ett ctg tec gtt tgc egg teg tgg gcg gca tgg tgc aag ttg aat aac 1152 
Leu Leu Ser Val Cys Arg Ser Trp Ala Ala Trp Cys Lys Leu Asn Asn 
370 375 380 

egg aaa tgg ttt ccc gca gaa cct gaa gat gtt cgc gat tat ctt eta 1200 
Arg Lys Trp Phe Pro Ala Glu Pro Glu Asp Val Arg Asp Tyr Leu Leu 
385 390 395 400 

tat ctt cag gcg cgc ggt ctg gca gta aaa act ate cag caa cat ttg 1248 
Tyr "Leu Gin Ala Arg Gly Leu Ala Val Lys Thr He Gin Gin His Leu 
405 410 415 

ggc cag eta aac atg ctt cat cgt egg tec ggg ctg cca cga cca agt 1296 
Gly Gin Leu Asn Met Leu His Arg Arg Ser Gly Leu Pro Arg Pro Ser 
420 425 430 

gac age aat get gtt tea ctg gtt atg egg egg ate cga aaa gaa aac 1344 
Asp Ser Asn Ala Val Ser Leu Val Met Arg Arg He Arg Lys Glu Asn 
435 440 445 

gtt gat gcc ggt gaa cgt gca aaa cag get eta gcg ttc gaa .cgc act 1392 
Val Asp Ala Gly Glu Arg Ala Lys Gin Ala -Leu Ala Phe Glu Arg Thr 
450 455- 460 

gat ttc gac cag gtt cgt tea etc atg gaa aat age gat cgc tgc cag 1440 
Asp Phe Asp Gin Val Arg Ser Leu Met Glu Asn Ser Asp Arg Cys Gin 
465 470 475 480 

gat ata cgt aat ctg gca ttt ctg ggg att get tat aac acc ctg tta 1488 
Asp He Arg Asn Leu Ala Phe Leu Gly He Ala Tyr Asn Thr Leu Leu 
485 490 495 

cgt ata gcc gaa att gcc agg ate agg gtt aaa gat ate tea cgt act 1536 
Arg He Ala Glu He Ala Arg He Arg Val Lys Asp He Ser Arg Thr 
500 505 510 

gac ggt ggg aga atg tta ate cat att ggc- aga acg* aaa acg ctg gtt 1584 
Asp Gly Gly Arg Met Leu He His He Gly Arg Thr Lys Thr Leu Val 
515 520 525 

age acc gca ggt gta gag aag gca ctt age ctg ggg gta act aaa ctg 1632 
Ser Thr Ala Gly Val Glu Lys Ala Leu Ser Leu Gly Val Thr Lys Leu 
530 535 540 
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gtc gag cga tgg att tec gtc tct ggt gta get gat gat ccg aat aac 1680 
Val Glu Arg Trp lie Ser Val Ser Gly Val Ala Asp Asp Pro Asn Asn 
545 550 555 560 

tac ctg ttt tgc egg gtc aga aaa aat ggt gtt gec gcg cca tct gec 1728 
Tyr Leu Phe Cys Arg Val Arg Lys Asn Gly Val Ala Ala Pro Ser Ala 
565 570 575 

acc age cag.cta tea act cgc gec ctg gaa ggg att ttt gaa gca act 177 6 
Thr Ser Gin Leu Ser Thr Arg Ala Leu Glu Gly lie Phe Glu Ala Thr 
580 585 590 

cat cga ttg att tac ggc get aag gat gac tct ggt cag aga tac ctg 1824 
His Arg Leu He Tyr Gly Ala Lys Asp Asp Ser Gly Gin Arg Tyr Leu 
595 600 605 

gec tgg tct gga cac agt gee cgt gtc gga gec gcg cga gat atg gec 1872 
. Ala Trp Ser Gly His Ser Ma Arg Val Gly Ala Ala Arg Asp Met Ala 
610 615 620 

cgc get gga gtt tea sta ccg gag ate atg caa get ggt ggc tgg acc 1920 
Arg Ala Gly Val Ser He Pro Glu He Met Gin Ala Gly Gly Trp Thr 
625 630 635 640 

aat gta aat att gtc atg aac tat ate cgt aac ctg gat agt gaa aca 1968 
Asn Val Asn He Val Met Asn Tyr He Arg Asn Leu Asp Ser Glu Thr 
645 650 655 

ggg gca atg gtg cgc ctg ctg gaa gat ggc gat tag 200'4 
Gly Ala Met Val Arg Leu Leu Glu Asp Gly Asp 
660 665 

<210> 6 
<211> 667 
<212> PRT 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: DNA sequence 
coding for a fusion protein VP22-Cre 

<400> 6 

Met Thr. Ser Arg Arg Ser Val Lys Ser Gly Pro Arg Glu Val Pro Arg 
15 10 15 

Asp Glu Tyr Glu Asp Leu Tyr Tyr Thr Pro Ser Ser Gly Met Ala Ser 
20 25 30 

Pro Asp Ser Pro Pro Asp Thr Ser Arg Arg Gly Ala Leu Gin Thr Arg 
35 . 40 45 

Ser Arg Gin Arg Gly Glu Val Arg Phe Val Gin Tyr Asp Glu Ser Asp 
50 55 60 

Tyr Ala Leu Tyr Gly Gly Ser Ser Ser Glu Asp Asp Glu His Pro Glu 
65 70 75 80 

Val Pro Arg Thr Arg Axg Pro Val Ser Gly Ala Val Leu Ser Gly Pro 
85 90 95 

Gly Pro Ala Arg Ala Pro Pro Pro Pro Ala Gly Ser Gly Gly Ala Gly 
100 105 110 

Arg Thr Pro Thr Thr Ala Pro Arg Ala Pro Arg Thr Gin Arg Val Ala 
115 120 125 
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Thr Lys Ala Pro Ala Ala Pro Ala Ala Glu Thr Thr Arg Gly Arg Lys 
130 135 140 

Ser Ala Gin Pro Glu Ser Ala Ala Leu Pro Asp Ala Pro Ala Ser Thr 
145 150 155 160 

Ala Pro Thr Arg Ser Lys Thr Pro Ala Gin Gly Leu Ala Arg Lys Leu 
165 170 175 

His Phe Ser Thr Ala Pro Pro Asn Pro Asp Ala Pro Trp Thr Pro Arg 
180 185 " 190 

Val Ala Gly Phe Asn Lys Arg Val Phe Cys Ala Ala Val Gly Arg Leu 
195 200 205 

Ala Ala Met His Ala Arg Met Ala Ala Val Gin Leu Trp Asp Met Ser 
210 215 220 

Arg Pro Arg Thr Asp Glu Asp Leu Asn Glu Leu Leu Gly lie Thr Thr 
225 23G 235 ~ 240 

He Arg Val Thr Val Cys Glu Gly Lys Asn Leu Leu Gin Arg Ala Asn 
245 250 • 255 

Glu Leu Val Asn Pro Asp Val Val Gin Asp Val Asp Ala Ala Thr Ala 
260 265 270 

Thr Arg Gly Arg Ser Ala Ala Ser Arg Pro Thr Glu Arg Pro Arg Ala 
275 280 285 

Pro Ala Arg Ser Ala Ser Arg Pro Arg Arg Pro Val Glu Gly. Thr Glu 
290 295 300 

Leu Gly Ser Thr Ser Pro Val Trp Trp Asn Ser Ala Asp He Gin His 
305 310 315 320 

Ser Gly Gly Arg Met Ser Asn Leu Leu Thr Val His Gin Asn Leu Pro 
325 330 335 

Ala Leu Pro Val Asp Ala Thr Ser Asp Glu Val Arg Lys Asn Leu Met 
340 345 350 

Asp Met Phe Arg Asp Arg Gin Ala Phe Ser Glu His Thr Trp Lys Met 
355 360 365 

Leu Leu Ser Val Cys Arg Ser Trp Ala Ala Trp Cys Lys Leu Asn Asn 
370 375 380 

Arg Lys Trp Phe Pro Ala Glu Pro Glu Asp Val Arg Asp Tyr Leu Leu 
385 390 395 400 

Tyr Leu Gin Ala Arg Gly Leu Ala Val Lys Thr He Gin Gin His Leu 
405 410 415 

Gly Gin Leu Asn Met Leu His Arg Arg Ser Gly Leu Pro Arg Pro Ser 
420 425 430 

Asp Ser Asn Ala Val Ser Leu Val Met Arg Arg He Arg Lys Glu Asn 
435 440. 445 

Val Asp Ala Gly Glu Arg Ala Lys Gin Ala Leu Ala Phe Glu Arg Thr 
450 455 460 



WO 01/49832 



PCT/EP01/00060 



37 

Asp Phe Asp Gin Val Arg Ser Leu Met Glu Asn Ser Asp Arg Cys Gin 
465 470 475 480 

Asp He Arg Asn Leu Ala Phe Leu Gly He Ala Tyr Asn Thr Leu Leu 
485 490 495 

Arg He Ala Glu He Ala Arg He Arg Val Lys Asp He Ser Arg Thr 
500 ■ 505 510 

Asp Gly Gly Arg Met Leu He His He Gly Arg Thr Lys Thr Leu Val 
515 520 525 

Ser Thr Ala Gly Val Glu Lys Ala Leu Ser Leu Gly Val Thr Lys Leu 
530 535 540 

Val Glu Arg Trp He Ser Val Ser Gly Val Ala Asp Asp Pro Asn Asn 
545 550 555 560 

Tyr Leu Phe Cys Arg Val Arg Lys Asn Gly Val Ala Ala Pro Ser Ala 
565 570 575 

Thr Ser Gin Leu Ser Thr Arg Ala Leu Glu Gly He Phe Glu Ala Thr 
580 585 590 

His Arg Leu He Tyr Gly Ala Lys Asp Asp Ser Gly Gin Arg Tyr Leu 
595 600 605 

Ala Trp Ser Gly His Ser Ala Arg Val Gly Ala Ala Arg Asp Met Ala 
610 615 620 



Arg Ala Gly Val Ser He Pro Glu He Met Gin Ala Gly Gly Trp Thr 
62 5 630 635 .640 

Asn Val Asn He Val Met Asn Tyr He Arg Asn Leu Asp Ser Glu Thr 
645 650 655 

Gly Ala Met Val Arg Leu Leu Glu Asp Gly Asp 
660 665 



<210> 7 
<211> 2247 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: DNA sequence 
coding for a fusion protein VP22-Flpe 

<220> 

<221> CDS 

<222> (1) . . (2241) 

<400> 7 

atg acc tct cgc cgc tec gtg aag teg ggt ccg egg gag gtt ccg cgc 4 8 

Met Thr Ser Arg Arg Ser Val Lys Ser Gly Pro Arg Glu Val Pro Arg 
15 10 15 



gat gag tac gag gat ctg tac tac acc ccg tct 
Asp Glu Tyr Glu Asp Leu Tyr Tyr Thr Pro Ser 
20 25 



tea 
Ser 



ggt atg gcg agt 
Gly Met Ala Ser 
30 



96 
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ccc gat agt ccg cct gac acc tec cgc cgt ggc gec eta cag aca cgc 144 
Pro Asp Ser Pro Pro Asp Thr Ser Arg Arg Gly Ala Leu Gin Thr Arg 
35 40 45 

teg cgc cag agg ggc gag gtc cgt ttc gtc cag tac gac gag teg gat 192 
Ser Arg Gin Arg Gly Glu Val Arg Phe Val Gin Tyr Asp Glu Ser Asp 
50 55 60 

tat gec etc tac ggg ggc teg tct tec gaa gac gac gaa cac ccg. gag 240 
Tyr Ala Leu Tyr Gly Gly Ser Ser Ser Glu Asp Asp Glu His Pro Glu 
65 70 75 80 

gtc ccc egg acg egg cgt ccc gtt tec ggg gcg gtt ttg tec ggc ccg 288 
Val Pro Arg Thr Arg Arg Pro Val Ser Gly Ala Val Leu Ser Gly Pro 
85 90 95 

ggg cct gcg egg gcg cct ccg cca ccc get ggg tec gga ggg gee gga 336 
Gly Pro Ala Arg Ala Pro Pro Pro Pro Ala Gly Ser Gly Gly Ala Gly 
100 105 110 

cgc aca ccc acc acc gee ccc egg gec ccc cga acc cag egg gtg gcg 384 
Arg Thr Pro Thr Thr Ala Pro Arg Ala Pro Arg Thr Gin Arg Val Ala 
115 120 125 

act aag gec ccc gcg gee ccg gcg gcg gag acc acc cgc ggc agg aaa 432 
Thr Lys Ala Pro Ala Ala Pro Ala Ala Glu Thr Thr Arg Gly Arg Lys 
130 135 140 

teg gec cag cca gaa tec gec gca etc cca gac gee ccc gcg teg acg 4 80 
Ser Ala Gin Pro Glu Ser Ala Ala Leu Pro Asp Ala Pro Ala Ser Thr 
145 150 155 160 

gcg cca acc cga tec aag aca ccc gcg cag ggg ctg gec aga aag ctg 528 
Ala. Pro Thr Arg Ser Lys Thr Pro Ala Gin Gly Leu Ala Arg Lys Leu 
165 170 175 

cac ttt age acc gec ccc cca aac ccc gac gcg cca tgg acc ccc egg 576 
His Phe Ser Thr Ala Pro Pro Asn Pro Asp Ala Pro Trp Thr Pro Arg 
180 185 190 

gtg gee ggc ttt aac aag cgc gtc ttc tgc gec gcg gtc ggg cgc ctg 624 
Val Ala Gly Phe Asn Lys Arg Val Phe Cys Ala Ala Val Gly Arg Leu 
195 200 205 

gcg gee atg cat gee egg atg gcg gcg gtc cag etc tgg gac atg teg 672 
Ala Ala Met His Ala Arg Met Ala Ala Val Gin Leu Trp Asp Met Ser 
210 215 220 

- cgt ccg cgc aca gac gaa gac etc aac gaa etc ctt ggc ate acc acc 720 
Arg Pro Arg Thr Asp Glu Asp Leu Asn Glu Leu Leu Gly lie Thr Thr 
225 230 235 240 

ate cgc gtg acg gtc tgc gag ggc -aaa aac ctg ctt cag cgc gec aac 7 68 
He Arg Val Thr Val Cys Glu Gly Lys Asn Leu Leu Gin Arg Ala Asn 
245 250 255 

gag ttg gtg aat cca gac gtg gtg cag gac gtc gac gcg gee acg gcg 816 
Glu Leu Val Asn Pro Asp Val Val Gin Asp Val Asp Ala Ala Thr Ala 
260 265 270 

act cga ggg cgt tct gcg gcg teg cgc ccc acc gag cga cct cga gec 8 64 
Thr Arg Gly Arg Ser Ala Ala Ser Arg Pro Thr Glu Arg Pro Arg Ala 
275 280 285 
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cca gcc cgc tec get tct cgc ccc aga egg ccc gtc gag ggt acc gag 912 

Pro Ala Arg Ser Ala Ser Arg Pro Arg Arg Pro Val Glu Gly Thr Glu 
290 295 300 

etc gga tec act agt cca gtg tgg tgg aat tct gca gat ate cag cac 960 
Leu Gly Ser Thr Ser Pro Val Trp Trp Asn Ser Ala Asp lie Gin His 
305 310 315 320 

agt ggc ggc cgc atg agt caa ttt gat ata tta tgt aaa aca cca cct 1008 
Ser Gly Gly Arg Met Ser Gin Phe Asp lie. Leu Cys Lys Thr Pro Pro 
325 330 335 

aag gtc ctg gtt cgt cag ttt gtg gaa agg ttt gaa aga cct tea ggg 1056 
Lys Val Leu Val Arg Gin Phe Val Glu Arg Phe Glu Arg Pro Ser Gly 
340 345 350 

gaa aaa ata gca tea tgt get get gaa eta acc tat tta tgt tgg atg 1104 
Glu Lys lie Ala Ser Cys Ala Ala Glu Leu Thr Tyr Leu Cys Trp Met 
355 360 365 

airt act cat aac gga aca gca ate aag aga gcc aca ttc atg age tat 1152 
lie Thr His Asn Gly Thr Ala He Lys Arg Ala Thr Phe Met Ser Tyr 
370 375 380 

aat act ate ata age aat teg ctg agt ttc gat att gtc aac aaa tea 1200 
Asn Thr He He Ser Asn Ser Leu Ser Phe Asp He Val Asn Lys Ser 
385 390 395 400 

etc cag ttt aaa tac aag acg caa aaa gca aca att ctg gaa gcc tea . 1248 
Leu Gin Phe Lys Tyr Lys Thr Gin Lys Ala Thr He Leu Glu Ala Ser 
405 410 415 

tta aag aaa tta att cct get tgg gaa ttt aca att att cct tac aat 1296 
Leu Lys Lys Leu He Pro Ala Trp Glu Phe Thr He lie Pro Tyr Asn 
420 425 430 

gga caa aaa cat caa tct gat ate act gat att gta agt agt ttg caa 1344 
Gly Gin Lys His Gin Ser Asp He Thr Asp He Val Ser Ser Leu Gin 
435 440 445 

tta cag ttc gaa tea teg gaa gaa gca gat aag gga aat age cac agt 1392 
Leu Gin Phe Glu Ser Ser Glu Glu Ala Asp Lys Gly Asn Ser His Ser 
450 455 460 

aaa aaa atg ctt aaa gca ctt eta agt gag ggt gaa age ate tgg gag 1440 
Lys Lys Met Leu Lys Ala Leu Leu Ser Glu Gly Glu Ser He Trp Glu 
465 470 475 480 

ate act gag aaa ata eta aat teg ttt gag tat acc teg aga ttt aca 1488 
He Thr Glu Lys He Leu Asn Ser Phe Glu Tyr Thr Ser Arg Phe Thr 
485 490 495 

aaa aca aaa act tta tac caa ttc -etc ttc eta get act ttc ate aat 1536 
Lys Thr Lys Thr Leu Tyr Gin Phe Leu Phe Leu Ala Thr Phe He Asn 
500 505 510 

tgt gga aga ttc age gat att aag aac gtt gat ccg aaa tea ttt aaa 1584 
Cys Gly Arg Phe Ser Asp He Lys Asn Val Asp Pro Lys Ser Phe Lys 
515 520 525 

tta gtc caa aat aag tat ctg gga gta ata ate cag tgt tta gtg aca 1632 
Leu Val Gin Asn Lys Tyr Leu Gly Val He He Gin Cys Leu Val Thr 
530 535 540 
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gag aca aag aca age gtt agt agg cac ata tac ttc ttt age gca agg 1680 
Giu Thr Lys Thr Ser Val Ser Arg His lie Tyr Phe Phe Ser Ala Arg 
545 550 555 560 

ggt agg ate gat cca ctt gta tat ttg gat gaa ttt ttg agg aat tct 1728 
Gly Arg lie Asp Pro Leu Val Tyr Leu Asp Glu Phe Leu Arg Asn Ser 
565 570 575 

gaa cca gtc eta aaa cga gta aat agg acc ggc aat tct tea age aac 1776 
Glu Pro Val Leu Lys Arg Val Asn Arg Thr Gly Asn Ser Ser Ser Asn 
580 585 590 

aaa cag gaa tac caa tta tta aaa gat aac tta gtc aga teg tac aac 1824 
Lys Gin Glu Tyr Gin Leu Leu Lys Asp Asn Leu Val Arg Ser Tyr Asn 
595 600 605 

aag get ttg aag aaa aat gcg cct tat cca ate ttt get ata aag aat 1872 
Lys Ala Leu Lys Lys Asn Ala Pro Tyr Pro He Phe Ala He Lys Asn 
610 615 620 



ggc cca aaa tct cac att gga aga cat ttg atg acc tea ttt ctg tea 
Gly Pro Lys Ser His He Gly Arg His Leu Met Thr Ser Phe Leu Ser 
625 630 635 640 



att gag gag tgg cag cat ata gaa cag eta aag ggt agt get gaa gga 

He Glu Glu Trp Gin His lie Glu Gin Leu Lys Gly Ser Ala Glu Gly 

705 710 715 720 

age ata cga tac ccc gca tgg aat ggg ata ata tea cag gag gta eta 

Ser He Arg Tyr Pro Ala Trp Asn Gly He He Ser Gin Glu Val Leu 
725 730 735 

gac tac ctt tea tec tac ata aat aga cgc ata taatga 
Asp Tyr Leu Ser Ser Tyr lie Asn Arg Arg He 
740 745 



<210> 8 
<211> 747 
<212> PRT 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: DNA sequence 
coding for a fusion protein VP22-Flpe 

<400> 8 

Met Thr Ser Arg Arg Ser Val Lys Ser Gly Pro Arg Glu Val Pro Arg 
15 10 15 



1920 



atg aag ggc eta acg gag ttg act aat gtt gtg gga aat tgg age gat 1968 
Met Lys Gly Leu Thr Glu Leu Thr Asn Val Val Gly Asn Trp Ser Asp 
645 650 655 

aag cgt get tct gee gtg gee agg aca acg tat act cat cag ata aca 
Lys Arg Ala Ser Ala Val Ala Arg Thr Thr Tyr Thr His Gin lie Thr 
660 665 670 



2016 



2064 



gca ata cct gat cac tac ttc gca eta gtt tct egg tac tat gca tat 
Ala He Pro Asp His Tyr Phe Ala Leu Val Ser Arg Tyr Tyr Ala Tyr 
675 680, 685 

gat cca ata tea aag gaa atg ata gca ttg aag gat gag act aat cca 2112 
Asp Pro He Ser Lys Glu Met He Ala Leu Lys Asp Glu Thr Asn Pro 
690 695 700 



2160 



2208 



2247 
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Asp Glu Tyr Glu Asp Leu Tyr Tyr Thr Pro Ser Ser Gly Met Ala Ser 
20 25 30 

Pro Asp Ser Pro Pro Asp Thr Ser Arg Arg Gly Ala Leu Gin Thr Arg 
35 40 45 

Ser Arg Gin Arg Gly Glu Val Arg Phe Val Gin Tyr Asp Glu Ser Asp 
50 55 60 

Tyr Ala Leu Tyr Gly Gly Ser Ser Ser Glu Asp Asp Glu His Pro Glu 
65 70 75 80 

Val Pro Arg Thr Arg Arg Pro Val Ser Gly Ala Val Leu Ser Gly Pro 
85 90 95 

Gly Pro Ala Arg Ala Pro Pro Pro Pro Ala Gly Ser Gly Gly Ala Gly 
100 105 110 

Arg Thr Pro Thr Thr Ala Pro Arg Ala Pro Arg Thr Gin Arg Val Ala 
115 120 125 

Thr Lys Ala Pro Ala Ala Pro Ala Ala Glu Thr Thr Arg Gly Arg Lys 
130 135 140 

Ser Ala Gin Pro Glu Ser Ala Ala Leu Pro Asp Ala Pro Ala Ser Thr 
145 150 155 160 

Ala Pro Thr Arg Ser Lys Thr Pro Ala Gin Gly Leu Ala Arg Lys Leu 
165 170 175 

His Phe Ser Thr Ala Pro Pro Asn Pro Asp Ala Pro Trp Thr Pro Arg 
180 185 190 

Val Ala Gly Phe Asn Lys Arg Val Phe Cys Ala Ala Val Gly Arg Leu 
195 200 205 

Ala Ala Met His Ala Arg Met Ala Ala Val Gin Leu Trp Asp Met Ser 
210 215 220 

Arg Pro Arg Thr Asp Glu Asp Leu Asn Glu Leu Leu Gly lie Thr Thr 
225 230 235 240 

lie Arg Val Thr Val Cys Glu Gly Lys Asn Leu Leu Gin Arg Ala Asn 
245 250 255 

Glu Leu Val Asn Pro Asp Val Val Gin Asp Val Asp Ala Ala Thr Ala 
260 265 270 

Thr Arg' Gly Arg Ser Ala Ala Ser Arg Pro Thr Glu Arg Pro Arg Ala 
275 280 285 

Pro Ala Arg Ser Ala Ser Arg Pro Arg Arg Pro Val Glu Gly Thr Glu 
290 295 300 

Leu Gly Ser Thr Ser Pro Val Trp Trp Asn Ser Ala Asp lie Gin His 
305 310 315 320 

Ser Gly Gly Arg Met Ser Gin Phe Asp lie Leu Cys Lys Thr Pro Pro 
325 330 335 

Lys Val Leu Val Arg Gin Phe Val Glu Arg Phe Glu Arg Pro Ser Gly 
340 345 350 
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Glu Lys lie Ala Ser Cys Ala Ala Glu Leu Thr Tyr Leu Cys Trp Met 
355 360 365 

lie Thr His Asn Gly Thr Ala lie Lys Arg Ala Thr Phe Met Ser Tyr 
370 375 380 

Asn Thr lie He Ser Asn Ser Leu Ser Phe Asp He Val Asn Lys Ser 
385 390 395 400 

Leu Gin Phe Lys Tyr Lys Thr Gin Lys Ala Thr He Leu Glu Ala Ser 
405 410 415 

Leu Lys Lys Leu He Pro Ala Trp Glu Phe Thr He He Pro Tyr Asn 
420 425 430 

Gly Gin Lys His Gin Ser Asp He Thr Asp He Val Ser Ser Leu Gin 
435 . 440 445 

Leu Gin Phe Glu Ser Ser Glu Glu Ala Asp Lys Gly Asn Ser His Ser 

450 4*55 4 60 

Lys Lys Met Leu Lys Ala Leu Leu Ser Glu Gly Glu Ser He Trp Glu 
465 470 475 480 

He Thr Glu Lys He Leu Asn Ser Phe Glu Tyr Thr Ser Arg Phe Thr 
485 490 495 

Lys Thr Lys Thr Leu Tyr Gin Phe Leu Phe Leu Ala Thr Phe He Asn 
500 505 510 

Cys Gly Arg Phe Ser Asp He Lys Asn Val Asp Pro Lys Ser Phe Lys 
515 520 525 

Leu Val Gin Asn Lys Tyr Leu Gly Val He He Gin Cys Leu Val Thr 
530 535 540 

Glu Thr Lys Thr Ser Val Ser Arg His He Tyr Phe Phe Ser Ala Arg 
545 550 555 560 

Gly Arg He Asp Pro Leu Val Tyr Leu Asp Glu Phe Leu Arg Asn Ser 
565 570 575 

Glu Pro Val Leu Lys Arg Val Asn Arg Thr Gly Asn Ser Ser Ser Asn 
580 585 590 

Lys Gin Glu Tyr Gin Leu Leu Lys Asp Asn Leu Val Arg Ser Tyr Asn 
595 600 605 

Lys Ala Leu Lys Lys Asn Ala Pro Tyr Pro He Phe Ala He Lys Asn 
610 615 620 

Gly Pro Lys Ser His He Gly Arg His Leu Met Thr Ser Phe Leu Ser 
625 630 635 640 

Met Lys Gly Leu Thr Glu Leu Thr Asn Val Val Gly Asn Trp Ser Asp 
645 650 655 

Lys Arg Ala Ser Ala Val Ala Arg Thr Thr Tyr Thr His Gin He Thr 
660 665 670 

Ala He Pro Asp His Tyr Phe Ala Leu Val Ser Arg Tyr Tyr Ala Tyr 
675 680 685 
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Asp Pro lie Ser Lys Glu Met lie Ala Leu Lys Asp Glu Thr Asn Pro 
690 695 700 

lie Glu Glu Trp Gin His lie Glu Gin Leu Lys Gly Ser Ala Glu Gly 
705 710 715 720 

Ser lie Arg Tyr Pro Ala Trp Asn Gly lie lie Ser Gin Glu Val Leu 
725 730 735 

Asp Tyr Leu Ser Ser Tyr lie Asn Arg Arg lie 
740 745 



<210> 9 
<211> 33 
<212> DNA 

<213> Human immunodeficiency virus 
<400> 9 

taeggeegca agaagcgccg ccaacgccgc cgc 33 



<210> 10 
<211> 11 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 10 

Tyr Gly Arg Lys Lys Arg Arg Gin Arg Arg Arg 
15 10 



<210> 11 
<211> 42 
<212> DNA 

<213> Human immunodeficiency virus 

<220> 
<221> CDS 
<222> (4) . . (42) 

<400> 11 

atg ggc tac ggc cgc aag aag cgc cgc caa cgc cgc cgc ggc 42 

Gly Tyr Gly Arg Lys Lys Arg Arg Gin Arg Arg Arg Gly 
15 10 



<210> 12 
<211> 13 
<212> PRT 

<213> Human immunodeficiency virus 
<400> 12 

Gly Tyr Gly Arg Lys Lys Arg Arg Gin Arg Arg Arg Gly 
1 5 10 



<210> 13 
<211> 1623 
<212> DNA 

<213> Artificial Sequence 
<220> 
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<223> Description of Artificial Sequence: DNA sequence 
coding for a fusion protein deltaVP22cre-StrepTag 

<220> 

<221> CDS 

<222> (1) . . (1617) 

<400> 13 

atg get age atg act ggt gga cag caa atg ggt egg gat ccg teg acg 48 
Met Ala Ser Met Thr Gly Gly Gin Gin Met Gly Arg Asp Pro Ser Thr 
15 10 15 

gcg cca acc cga tec aag aca ccc gcg cag ggg ctg gec aga aag ctg 96 
Ala Pro Thr Arg Ser Lys Thr Pro Ala Gin Gly Leu Ala Arg Lys Leu 
20 25 30 

cac ttt age acc gec ccc cca aac ccc gac gcg cca tgg acc ccc egg 144 
His Phe Ser Thr Ala Pro Pro Asn Pro Asp Ala Pro Trp Thr Pro Arg 
35 40 45 

gtg gec ggc ttt aac- aag cgc gtc ttc tgc gec gcg gtc ggg cgc ctg 192 
Val Ala Gly Phe Asn Lys Arg Val Phe Cys Ala Ala Val Gly Arg Leu 
50 55 60 

gcg gec atg cat gec egg atg gcg get gtc cag etc tgg gac atg teg 240 
Ala Ala Met His Ala Arg Met Ala Ala Val Gin Leu Trp Asp Met * Ser 
65 70 75 80 

cgt ccg cgc aca gac gaa gac etc aac gaa etc ctt ggc ate acc acc 288 
Arg Pro Arg Thr Asp Glu Asp Leu Asn Glu Leu Leu Gly lie Thr Thr 
85 90 95 

ate cgc gtg acg gtc tgc gag ggc aaa aac ctg ctt cag cgc gee aac 336 
lie Arg Val Thr Val Cys Glu Gly Lys Asn Leu Leu Gin Arg Ala Asn 
100 105 110 

gag ttg gtg aat cca gac gtg gtg cag gac gtc gac gcg gec acg gcg 384 
Glu Leu Val Asn Pro Asp Val Val Gin Asp Val Asp Ala Ala Thr Ala 
115 120 125 

act cga ggg cgt tct gcg gcg teg cgc ccc acc gag cga cct cga gec 432 
Thr Arg Gly Arg Ser Ala Ala Ser Arg Pro Thr Glu Arg Pro Arg Ala 
130 135 140 

cca gec cgc tec get tct cgc ccc aga egg ccc gtc gag ggt acc gag 480 
Pro Ala Arg Ser Ala Ser Arg Pro Arg Arg Pro Val Glu Gly Thr Glu 
145 150 155 160 

etc gga tec act agt cca gtg tgg tgg aat tct gca gat ate cag cac 528 
Leu Gly Ser Thr Ser Pro Val Trp Trp Asn Ser Ala Asp lie Gin His 
165 170 175 

agt ggc ggc cgc atg tec aat.tta ctg acc gta cac caa aat ttg cct 57 6 
Ser Gly Gly Arg Met Ser Asn Leu Leu Thr Val His Gin Asn Leu Pro 
180 185 190 

gca tta ccg gtc gat gca acg agt gat gag gtt cgc aag aac ctg atg 624 
Ala Leu Pro Val Asp Ala Thr Ser Asp Glu Val Arg Lys Asn Leu Met 
195 200 205 

gac atg ttc agg gat cgc cag gcg ttt tct gag cat acc tgg aaa atg 672, 
Asp Met Phe Arg Asp Arg Gin Ala Phe Ser Glu His Thr Trp Lys Met 
210 215 220 
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ctt ctg tec gtt tgc egg teg tgg gcg gca tgg tgc aag ttg aat aac 720 

Leu Leu Ser Val Cys Arg Ser Trp Ala Ala Trp Cys Lys Leu Asn Asn 

225 230 235 240 

egg aaa tgg ttt ccc gca gaa cct gaa gat gtt cgc gat tat ctt eta 768 
Arg Lys Trp Phe Pro Ala Glu Pro Glu Asp Val Arg Asp Tyr Leu Leu 
245 250 255 

tat ctt cag gcg cgc ggt ctg gca gta aaa act ate cag caa cat ttg 816 
Tyr Leu Gin Ala Arg Gly Leu Ala Val Lys Thr lie Gin Gin His Leu 
260 265 270 

ggc cag eta aac atg ctt cat cgt egg tec ggg ctg cca cga cca agt 864 
Gly Gin Leu Asn Met Leu His Arg Arg Ser Gly Leu Pro Arg Pro Ser 
275 280 285 

gac age aat get gtt tea ctg gtt atg egg egg ate cga aaa gaa aac 912 
Asp Ser Asn Ala Val Ser Leu Val Met Arg Arg He Arg Lys Glu Asn 
290 295 300 

gtt gat gee ggt gaa cgt gca aaa cag get eta gcg ttc gaa cgc act 960 
Val Asp Ala Gly Glu Arg Ala Lys Gin Ala Leu Ala Phe Glu Arg Thr 
305 310 315 320 

gat ttc gac cag gtt cgt tea etc atg gaa aat age gat cgc tgc cag 1008 
Asp Phe Asp Gin Val Arg Ser Leu Met Glu Asn Ser Asp Arg Cys Gin 
325 330 335 

gat ata cgt aat ctg gca ttt ctg ggg att get tat aac ace ctg tta 1056 
Asp He Arg Asn Leu Ala Phe Leu Gly He Ala Tyr Asn Thr Leu Leu 
340 345 350 

cgt ata gee gaa att gec agg ate agg gtt aaa gat ate tea cgt act 1104 
Arg He Ala Glu He Ala Arg He Arg Val Lys Asp He Ser Arg Thr 
355 360 365 

gac ggt ggg aga atg tta ate cat att ggc aga acg aaa acg ctg gtt 1152 
Asp Gly Gly Arg Met Leu He His He Gly Arg Thr Lys Thr Leu Val 
370 375 380 

age ace gca ggt gta gag aag gca ctt age ctg ggg gta act -aaa ctg 1200 
Ser Thr Ala Gly Val Glu Lys Ala Leu Ser Leu Gly Val Thr Lys Leu 
385 390 395 400 

gtc gag cga tgg att tec gtc tct ggt gta get gat gat ccg aat aac 1248 
Val Glu Arg Trp He Ser Val Ser Gly Val Ala Asp Asp Pro Asn Asn 
405 410 415 

tac ctg ttt tgc egg gtc aga aaa aat ggt gtt gee gcg cca tct gee 1296 
Tyr Leu Phe Cys Arg Val Arg Lys Asn Gly Val Ala Ala Pro Ser Ala 
420 425 430 

acc age cag eta tea act cgc gee ctg gaa ggg att ttt gaa gca act 1344 
Thr Ser Gin Leu Ser Thr Arg Ala Leu Glu Gly lie Phe Glu Ala Thr 
435 440 445 

cat cga ttg att tac ggc get aag gat gac tct ggt cag aga tac ctg 1392 
His Arg Leu He Tyr Gly Ala Lys Asp Asp Ser Gly Gin Arg Tyr Leu 
450 455 460 

gec tgg tct gga cac agt gec cgt gtc gga gee gcg cga gat atg gee 1440 
Ala Trp Ser Gly His Ser Ala Arg Val Gly Ala Ala Arg Asp Met Ala 
465 470 475 480 
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cgc get gga gtt tea at a ccg gag ate atg caa get ggt ggc tgg ace 14 88 

Arg Ala Gly Val Ser He Pro Glu He Met Gin Ala Gly Gly Trp Thr 
485 490 495 

aat gta aat att gtc atg aac tat ate cgt aac ctg gat agt gaa aca 1536 

Asn Val Asn He Val Met Asn Tyr He Arg Asn Leu Asp Ser Glu Thr 
500 505 510 

ggg gca atg gtg cgc ctg ctg gaa gat ggc gat ggt ate gaa ggt cgt 1584 

Gly Ala Met Val Arg Leu Leu Glu Asp Gly Asp Gly He Glu Gly Arg 
515 520 525 

ggt age get tgg cgt cac ccg cag ttc ggt ggt taataa 1623 

Gly Ser Ala Trp Arg His Pro Gin Phe Gly Gly 
530 535 



<210> 14 
<211> 539 
<212> PRT 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: DNA sequence 
coding for a fusion protein deltaVP22cre-StrepTag 

<400> 14 

Met Ala Ser Met Thr Gly Gly Gin Gin Met Gly Arg Asp Pro Ser Thr 
15 10 15 

Ala Pro Thr Arg Ser Lys Thr Pro Ala Gin Gly Leu Ala Arg Lys Leu 
20 25 30 

His Phe Ser Thr Ala Pro Pro Asn Pro Asp Ala Pro Trp Thr Pro Arg 
35 40 45 

Val Ala Gly Phe Asn Lys Arg Val Phe Cys Ala Ala Val Gly Arg Leu 
50 55 60 

Ala Ala Met His Ala Arg Met Ala Ala Val Gin Leu Trp Asp Met Ser 
65 70 75 80 

Arg Pro Arg Thr Asp Glu Asp Leu Asn Glu Leu Leu Gly lie Thr Thr 
85 90 95 

He Arg Val Thr Val Cys Glu Gly Lys Asn Leu Leu Gin Arg Ala Asn 
100 105 110 

Glu Leu Val Asn Pro Asp Val Val Gin Asp Val Asp Ala Ala Thr Ala 
- - 115 120 125 

Thr Arg Gly Arg Ser Ala Ala Ser Arg Pro Thr Glu Arg Pro Arg Ala 
130 135 140 

Pro Ala Arg Ser Ala Ser Arg Pro Arg Arg Pro Val Glu Gly Thr Glu 
145 150 155 160 

Leu Gly Ser Thr Ser Pro Val Trp Trp Asn Ser Ala Asp He Gin His 
165 170 175 

Ser Gly Gly Arg Met Ser Asn Leu Leu Thr Val His Gin Asn Leu Pro 
180 185 190 

Ala Leu Pro Val Asp Ala Thr Ser Asp Glu Val Arg Lys Asn Leu Met 
195 200 205 
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Asp Met Phe Arg Asp Arg Gin Ala Phe Ser Glu His Thr Trp Lys Met 
210 215 220 

Leu Leu Ser Val Cys Arg Ser Trp Ala Ala Trp Cys Lys Leu Asn Asn 
225 230 235 240 

Arg Lys Trp Phe Pro Ala Glu Pro Glu Asp Val Arg Asp Tyr Leu Leu 
245 250 255 

Tyr Leu Gin Ala Arg Gly Leu Ala Val Lys Thr He Gin Gin His Leu 
260 265 270 

Gly Gin Leu Asn Met Leu His Arg Arg Ser Gly Leu Pro Arg Pro Ser 
275 280 285 

Asp Ser Asn Ala Val Ser Leu Val Met Arg Arg He Arg Lys Glu Asn 
290 295 300 

Val Asp Ala Gly Glu Arg Ala Lys Gin Ala Leu Ala Phe Glu Arg Thr 
305 310 315 320 

Asp Phe Asp Gin Val Arg Ser Leu Met Glu Asn Ser Asp Arg Cys Gin 
325 330 335 

Asp He Arg Asn Leu Ala Phe Leu Gly He Ala Tyr Asn Thr Leu Leu 
340 345 350 

Arg He Ala Glu He Ala Arg He Arg Val Lys Asp He Ser Arg Thr 
355 360 365 

Asp Gly Gly Arg Met Leu He His He Gly Arg Thr Lys Thr Leu Val 
370 375 380 

Ser Thr Ala Gly Val Glu Lys Ala Leu Ser Leu Gly Val Thr Lys Leu 
385 390 395 400 

Val Glu Arg Trp He Ser Val Ser Gly Val Ala Asp Asp Pro Asn Asn 
405 410 415 

Tyr Leu Phe Cys Arg Val Arg Lys Asn Gly Val Ala Ala Pro Ser Ala 
420 425 430 

Thr Ser Gin Leu Ser Thr Arg Ala Leu Glu Gly He Phe Glu Ala Thr 
435 440 445 

His Arg Leu lie Tyr Gly Ala Lys Asp Asp Ser Gly Gin Arg Tyr Leu 
450 455 460 

Ala Trp Ser Gly His Ser Ala Arg Val Gly Ala Ala Arg Asp Met Ala 
465 470 475 480 

Arg Ala Gly Val Ser He Pro Glu He Met Gin Ala Gly Gly Trp Thr 
485 490 495 

Asn Val Asn He Val Met Asn Tyr He Arg Asn Leu Asp Ser Glu Thr 
500 505 510 

Gly Ala Met Val Arg Leu Leu Glu Asp Gly Asp Gly He Glu Gly Arg 
515 520 525 

Gly Ser Ala Trp Arg His Pro Gin Phe Gly Gly 
530 535 
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<210> 15 
<211> 5953 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: vector 
pCRT7-deltaVPCS 

<400> 15 

cgatggattt ccgtctctgg tgtagctgat gatccgaata actacctgtt ttgccgggtc 60 
agaaaaaatg gtgttgccgc gccatctgcc accagccagc tatcaactcg cgccctggaa 120 
gggatttttg aagcaactca tcgattgatt tacggcgcta aggatgactc tggtcagaga 180 
tacctggcct ggtctggaca cagtgcccgt gtcggagccg cgcgagatat ggcccgcgct 240 
ggagtttcaa taccggagat catgcaagct ggtggctgga ccaatgtaaa tattgtcatg 300 
aactatatcc gtaacctgga tagtgaaaca ggggcaatgg tgcgcctgct ggaagatggc 360 
gatggtatcg aaggtcgtgg tagcgcttgg cgtcacccgc agttcggtgg ttaataagct 420 
tcgaacaaaa actcatctca gaagaggatc tgaatatgca taccggtcat catcaccatc 480 
accattgagt tttgagcaat aactagcata accccttggg gcctctaaac gggtcttgag 540 
gggttttttg ecgaaaggag gaactatatc cggatatcca caggacgggt gtggtcgcca 600 
tgatcgcgta gtcgatagtg gctccaagta gcgaagcgag caggactggg cggcggccaa 660 
agcggtcgga cagtgctccg agaacgggtg cgcatagaaa ttgcatcaac gcatatagcg 720 
ctagcagcac gccatagtga ctggcgatgc tgtcggaatg gacgatatcc cgcaagaggc 780 
ccggcagtac cggcataacc aagcctatgc ctacagcatc cagggtgacg gtgccgagga 84 0 
tgacgatgag cgcattgtta gatttcatac acggtgcctg actgcgttag caatttaact 900 
gtgataaact accgcattaa agcttatcga tgataagctg tcaaacatga gaattaattc 960 
ttagaaaaac tcatcgagca tcaaatgaaa ctgcaattta ttcatatcag gattatcaat 1020 
accatatttt tgaaaaagcc gtttctgtaa tgaaggagaa aactcaccga ggcagttcca 1080 
taggatggca agatcctggt atcggtctgc gattccgact cgtccaacat caatacaacc 114 0 
tattaatttc ccctcgtcaa aaataaggtt atcaagtgag aaatcaccat gagtgacgac 1200 
tgaatccggt gagaatggca aaagcttatg catttctttc cagacttgtt caacaggcca 1260 
gccattacgc tcgtcatcaa aatcactcgc atcaaccaaa ccgttattca ttcgtgattg 1320 
cgcctgagcg agacgaaata cgcgatcgct gttaaaagga caattacaaa caggaatcga 1380 
atgcaaccgg cgcaggaaca ctgccagcgc atcaacaata ttttcacctg aatcaggata 1440 
ttcttctaat acctggaatg ctgttttccc ggggatcgca gtggtgagta accatgcatc 1500 
atcaggagta cggataaaat gcttgatggt cggaagaggc ataaattccg tcagccagtt 1560 
tagtctgacc atctcatctg taacatcatt ggcaacgcta cctttgccat gtttcagaaa 1620 
caactctggc gcatcgggct tcccatacaa tcgatagatt gtcgcacctg attgcccgac 1680 
attatcgcga gcccatttat acccatataa atcagcatcc atgttggaat ttaatcgcgg 174 0 
cctcgagcaa gacgtttccc gttgaatatg gctcataaca ccccttgtat tactgtttat 1800 
gtaagcagac agttttattg ttcatgacca aaatccctta. acgtgagttt tcgttccact 1860 
gagcgtcaga ccccgtagaa aagatcaaag gatcttcttg agatcctttt tttctgcgcg 1920 
taatctgctg cttgcaaaca aaaaaaccac cgctaccagc ggtggtttgt ttgccggatc 1980 
aagagctacc aactcttttt ccgaaggtaa ctggcttcag cagagcgcag ataccaaata 2040 
ctgtccttct agtgtagccg tagttaggcc accacttcaa gaactctgta gcaccgccta 2100 
catacctcgc tctgctaatc ctgttaccag tggctgctgc cagtggcgat aagtcgtgtc 2160 
ttaccgggtt ggactcaaga cgatagttac cggataaggc gcagcggtcg ggctgaacgg 2220 
ggggttcgtg cacacagccc agcttggagc gaacgaccta _caccgaactg agatacctac 2280 
agcgtgagct atgagaaagc gccacgcttc ccgaagggag aaaggcggac aggtatccgg 2340 
taagcggcag ggtcggaaca ggagagcgca cgagggagct tccaggggga aacgcctggt 2400 
atctttatag tcctgtcggg tttcgccacc tctgacttga gcgtcgattt ttgtgatgct 24 60 
cgtcaggggg gcggagccta tggaaaaacg ccagcaacgc ggccttttta cggttcctgg 2520 
ccttttgctg gccttttgct cacatgttct ttcctgcgtt atcccctgat tctgtggata 2580 
accgtattac cgcctttgag tgagctgata ccgctcgccg cagccgaacg accgagcgca 2640 
gcgagtcagt gagcgaggaa gcggaagagc gcctgatgcg gtattttctc cttacgcatc 2700 
tgtgcggtat ttcacaccgc atatatggtg cactctcagt acaatctgct ctgatgccgc 27 60 
atagttaagc cagtatacac tccgctatcg ctacgtgact gggtcatggc ttjcgccccga 2820 
cacccgccaa cacccgctga cgcgccctga cgggcttgtc tgctcccggc atccgcttac 2880 
agacaagctg tgaccgtctc cgggagctgc atgtgtcaga ggttttcacc gtcatcaccg 2940 
aaacgcgcga ggcagctgcg gtaaagctca tcagcgtggt cgtgaagcga ttcacagatg 3000 
tctgcctgtt catccgcgtc cagctcgttg. agtttctcca gaagcgttaa tgtctggctt 3060 
ctgataaagc gggccatgtt aagggcggtt ttttcctgtt tggtcactga tgcctccgtg 3120 
taagggggat ttctgttcat gggggtaatg ataccgatga aacgagagag gatgctcacg 3180 
atacgggtta ctgatgatga acatgcccgg ttactggaac gttgtgaggg taaacaactg 3240 
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gcggtatgga tgcggcggga ccagagaaaa 
aatacagatg taggtgttcc acagggtagc 
ataatggtgc agggcgctga cttccgcgtt 
accattcatg ttgttgctca ggtcgcagac 
tcgcgtatcg gtgattcatt ctgctaacca 
ctcaacgaca ggagcacgat catgcgcacc 
cgccgcgtgc ggctgctgga gatggcggac 
tgcgcattca cagttctccg caagaattga 
ttagcgaggt gccgccggct tccattcagg 
gcaacgcggg gaggcagaca aggtataggg 
atgtgctcgc cgaggcggca taaatcgccg 
ggctggtaag agccgcgagc gatccttgaa 
tggacagcat ggcctgcaac gcgggcatcc 
tggggaaggc catccagcct cgcgtcgcga 
ccgccatgcc ggcgataatg gcctgcttct 
cgaaggcttg agcgagggcg tgcaagattc 
tcgcgctcca gcgaaagcgg tcctcgccga 
ctacgagttg catgataaag aagacagtca 
cccaccggaa ggagctgact gggttgaagg 
t at gcgB etc cfcgcatfcagg aagcagecca 
ccgcaaggaa tggtgcatgc aaggagatgg 
ccaccatacc cacgccgaaa caagcgctca 
categgtgat gteggegata taggegecag 
ccacgatgcg teeggegtag aggatcgaga 
tatagggaga ccacaacggt ttccctctag 
tatacatatg gctagcatga ctggtggaca 
aacccgatcc aagacacccg egcagggget 
cccaaacccc gacgcgccat ggaccccccg 
cgccgcggtc gggcgcctgg cggccatgca 
catgtcgcgt ccgcgcacag acgaagacct 
cgtgacggtc tgegagggea aaaacctget 
cgtggtgcag gaegtcgacg cggccacggc 
caccgagcga cctcgagccc cagcccgctc 
taccgagctc ggatccacta gtccagtgtg 
cggccgcatg tccaatttac tgacegtaca 
aacgagtgat gaggttcgea agaacctgat 
tgagcatacc tggaaaatgc ttctgtccgt 
gaataacegg aaatggtttc ccgcagaacc 
tcaggcgcgc ggtctggcag taaaaactat 
teategtegg tccgggctgc cacgaccaag 
geggatcega aaagaaaacg ttgatgeegg 
aegcactgat ttcgaccagg ttcgttcact 
aegtaatctg gcatttctgg ggattgetta 
caggatcagg gttaaagata tctcacgtac 
cagaacgaaa acgctggtta gcaccgcagg 
taaactggtc gag 
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atcactcagg gtcaatgeca gcgcttcgtt 3300 
cagcagcatc ctgegatgea gatceggaac 3360 
tccagacttt acgaaacacg gaaaccgaag 3420 
gttttgeage ageagtcget tcacgttcgc 3480 
gtaaggcaac cccgccagcc tagcegggtc 3540 
cgtggccagg acccaacgct geccgagatg 3600 
gcgatggata tgttctgcca agggttggtt 3660 
.ttggctccaa ttcttggagt ggtgaatccg 3720 
tcgaggtggc ccggctccat gcaccgcgac 3780 
cggcgcctac aatccatgcc aaccegtt cc 3840 
tgacgatcag eggtccagtg atcgaagtta 3900 
gctgtccctg atggtcgtca tctacctgcc 3960 
cgatgccgcc ggaagegaga agaatcataa 4020 
acgccagcaa gacgtagccc agegegtegg 4080 
cgccgaaacg tttggtggcg ggaccagtga 4140 
cgaataccgc aagegacagg ccgatcatcg 4200 
aaatgaccca gagcgctgcc ggcacctgtc 4260 
taagtgcggc gacgatagtc atgccccgcg 4320 
ctctcaaggg categgtega cgctctccct 4380 
■gtagtaggtt gaggeegirtg sgcaccgccg 4 4 4 0 
cgcccaacag tcccccggcc aeggggectg 4500 
tgagcccgaa gtggcgagcc cgatcttccc 4560 
caaccgcacc tgtggcgccg gtgatgeegg 4620 
tctcgatccc gcgaaattaa tacgactcac 4 680 
aaataatttt gtttaacttt aagaaggaga 4740 
gcaaatgggt egggatcegt cgacggcgcc 4800 
ggccagaaag ctgeacttta gcaccgcccc 4860 
ggtggccggc tttaacaagc gcgtcttctg 4 920 
tgcccggatg gcggctgtcc agctctggga 4 980 
caacgaactc cttggcatca ccaccatccg 5040 
tcagcgcgcc aacgagttgg tgaatccaga 5100 
gactcgaggg cgttctgcgg cgtcgcgccc 5160 
cgcttctcgc cccagacggc ccgtcgaggg 5220 
gtggaattct gcagatatcc agcacagtgg 5280 
ccaaaatttg cctgcattac eggtcgatge 5340 
ggacatgttc agggatcgee aggegtttte 5400 
ttgeeggteg tgggeggcat ggtgcaagtt 54 60 
tgaagatgtt cgegattate ttctatatct 5520 
ccagcaacat ttgggccagc taaacatget 5580 
tgacagcaat gctgtttcac tggttatgcg 5640 
tgaacgtgca aaacaggctc tagegttega 5700 
catggaaaat agegatcget gecaggatat 5760 
taacaccctg ttaegtatag ccgaaattgc 5820 
tgacggtggg agaatgttaa tccatattgg 5880 
tgtagagaag gcacttagcc tgggggtaac 5940 

5953 



<210> 16 
<211> 4727 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: vector 
pT7-TACS 

<400> 16 

ateeggatat agttcctcct ttcagcaaaa aacccctcaa gaecegttta gaggccccaa 60 

ggggttatgc tagttattgc tcagcggtgg cagcagccaa ctcagcttcc tttegggett 120 

tgttagcagc eggatctcag tggtggtggt ggtggtgctc gagtgeggee gcaagcttat 180 

taaccaccga actgegggtg acgccaagcg ctaccacgac cttcgatacc atcgccatct 240 

tccagcaggc gcaccattgc ccctgtttca ctatccaggt tacggatata gttcatgaca 300 

atatttacat tggtccagcc accagcttgc atgatctccg gtattgaaac tccagcgcgg 360 
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gccatatctc gcgcggctcc gacacgggca ctgtgtccag accaggccag gtatctctga 420 

ccagagtcat ccttagcgcc gtaaatcaat cgatgagttg cttcaaaaat cccttccagg 480 

gcgcgagttg atagctggct ggtggcagat ggcgcggcaa caccattttt tctgacccgg 540 

caaaacaggt agttattcgg atcatcagct acaccagaga cggaaatcca tcgctcgacc 600 

agtttagtta cccccaggct aagtgccttc tctacacctg cggtgctaac cagcgttttc 660 

gttctgccaa tatggattaa cattctccca ccgtcagtac gtgagatatc tttaaccctg 720 

atcctggcaa tttcggctat acgtaacagg gtgttataag caatccccag aaatgccaga 780 

ttacgtatat cctggcagcg atcgctattt tccatgagtg aacgaacctg gtcgaaatca 840 

gtgcgttcga acgctagagc ctgttttgca cgttcaccgg catcaacgtt ttcttttcgg 900 

atccgccgca taaccagtga aacagcattg ctgtcacttg gtcgtggcag cccggaccga 960 

cgatgaagca tgtttagctg gcccaaatgt tgctggatag tttttactgt cagaccgcgc 1020 

gcctgaagat atagaagata atcgcgaaca tcttcaggtt ctgcgggaaa ccatttccgg 1080 

ttattcaact tgcaccatgc cgcccacgac cggcaaacgg acagaagcat tttccaggta 1140 

tgctcagaaa acgcctggcg atccctgaac atgtccatca ggttcttgcg aacctcatca 1200 

ctcgttgcat cgaccggtaa tgcaggcaaa ttttggtgta cggtcagtaa attggacatg 1260 

ccgcggcggc gttggcggcg cttcttgcgg ccgtagccca tggtatatct ccttcttaaa 1320 

gttaaacaaa attatttcta gagggaaacc gttgtggtct ccctatagtg agtcgtatta 1380 

atttcgcggg atcgagatct cgggcagcgt tgggtcctgg ccacgggtgc gcatgatcgt 1440 

gctcctgtcg ttgaggaccc ggctaggctg gcggggttgc cttactggtt agcagaatga 1500 

aJtcaecgata cgcgagcgaa Ggtgaagega ctgctgctgc aaaacgtctg cgacctgagc 1560 

aacaacatga atggtcttcg gtttccgtgt ttcgtaaagt ctggaaacgc ggaagtcagc 1620 

gccctgcacc attatgttcc ggatctgcat cgcaggatgc tgctggctac cctgtggaac 1680 

acctacatct gtattaacga agcgctggca ttgaccctga gtgatttttc tctggtcccg 1740 

ccgcatccat accgccagtt gtttaccctc acaacgttcc agtaaccggg catgttcatc 1800 

atcagtaacc cgtatcgtga gcatcctctc tcgtttcatc ggtatcatta cccccatgaa 1860 

cagaaatccc ccttacacgg aggcatcagt gaccaaacag gaaaaaaccg cccttaacat 1920 

ggcccgcttt atcagaagcc agacattaac gcttctggag aaactcaacg agctggacgc 1980 

ggatgaacag gcagacatct gtgaatcgct tcacgaccac gctgatgagc tttaccgcag 2040 

ctgcctcgcg cgtttcggtg atgacggtga aaacctctga cacatgcagc tcccggagac 2100 

ggtcacagct tgtctgtaag cggatgccgg gagcagacaa gcccgtcagg gcgcgtcagc 2160 

gggtgttggc gggtgtcggg gcgcagccat gacccagtca cgtagcgata gcggagtgta 2220 

tactggctta actatgcggc atcagagcag attgtactga gagtgcacca tatatgcggt 2280 

gtgaaatacc gcacagatgc gtaaggagaa aataccgcat caggcgctct tccgcttcct 2340 

cgctcactga ctcgctgcgc tcggtcgttc ggctgcggcg agcggtatca gctcactcaa 2400 

aggcggtaat acggttatcc acagaatcag gggataacgc aggaaagaac atgtgagcaa 24 60 

aaggccagca aaaggccagg aaccgtaaaa aggccgcgtt gctggcgttt ttccataggc 2520 

tccgcccccc tgacgagcat cacaaaaatc gacgctcaag tcagaggtgg cgaaacccga 2580 

caggactata aagataccag gcgtttcccc ctggaagctc cctcgtgcgc tctcctgttc 2640 

cgaccctgcc gcttaccgga tacctgtccg cctttctccc ttcgggaagc gtggcgcttt 2700 

ctcatagctc acgctgtagg tatctcagtt cggtgtaggt cgttcgctcc aagctgggct 2760 

gtgtgcacga accccccgtt cagcccgacc gctgcgcctt atccggtaac tatcgtcttg 2820 

agtccaaccc ggtaagacac gacttatcgc cactggcagc agccactggt aacaggatta 2880 

gcagagcgag gtatgtaggc ggtgctacag agttcttgaa gtggtggcct aactacggct 2940 

acactagaag gacagtattt ggtatctgcg ctctgctgaa gccagttacc ttcggaaaaa 3000 

gagttggtag ctcttgatcc ggcaaacaaa ccaccgctgg tagcggtggt ttttttgttt 3060 

gcaagcagca gattacgcgc agaaaaaaag gatctcaaga agatcctttg atcttttcta 3120 

cggggtctga cgctcagtgg aacgaaaact cacgttaagg gattttggtc atgagattat 3180 

caaaaaggat cttcacctag atccttttaa attaaaaatg aagttttaaa tcaatctaaa 3240 

gtatatatga gtaaacttgg tctgacagtt accaatgctt aatcagtgag gcacctatct 3300 

cagcgatctg tctatttcgt tcatccatag ttgcctgact ccccgtcgtg tagataacta 3360 

cgatacggga gggcttacca tctggcccca gtgctgcaat gataccgcga gacccacgct 3420 

caccggctcc agatttatca gcaataaacc agccagccgg aagggccgag cgcagaagtg 3480 

gtcctgcaac tttatccgcc tccatccagt ctattaattg ttgccgggaa gctagagtaa 3540 

gtagttcgcc agttaatagt ttgcgcaacg ttgttgccat tgctgcaggc atcgtggtgt 3600 

cacgctcgtc gtttggtatg gcttcattca gctccggttc ccaacgatca aggcgagtta 3660 

catgatcccc catgttgtgc aaaaaagcgg ttagctcctt cggtcctccg atcgttgtca 3720 

gaagtaagtt ggccgcagtg ttatcactca tggttatggc agcactgcat aattctctta 3780 

ctgtcatgcc atccgtaaga tgcttttctg tgactggtga gtactcaacc aagtcattct 3840 

gagaatagtg tatgcggcga ccgagttgct cttgcccggc gtcaatacgg gataataccg 3900 

cgccacatag cagaacttta aaagtgctca tcattggaaa acgttcttcg gggcgaaaac 3960 

tctcaaggat cttaccgctg ttgagatcca gttcgatgta acccactcgt gcacccaact 4020 

gatcttcagc atcttttact ttcaccagcg' tttctgggtg agcaaaaaca ggaaggcaaa 4080 

atgccgcaaa aaagggaata agggcgacac ggaaatgttg aatactcata ctcttccttt 4140 

ttcaatatta ttgaagcatt tatcagggtt attgtctcat gagcggatac atatttgaat 4200 
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gtatttagaa 
aaattgtaaa 
tttttaacca 
tagggttgag 
acgtcaaagg 
aatcaagttt 
cccgatttag 
cgaaaggagc 
cacccgccgc 



aaataaacaa 
cgttaatatt 
ataggccgaa 
tgttgttcca 
gcgaaaaacc 
tttggggtcg 
agcttgacgg 
gggcgctagg 
gcttaatgcg 



ataggggttc 
ttgttaaaat 
atcggcaaaa 
gtttggaaca 
gtctatcagg 
aggtgccgta 
ggaaagccgg 
gcgctggcaa 
ccgctacagg 



cgcgcacatt 
tcgcgttaaa 
tcccttataa 
agagtccact 
gcgatggccc 
aagcactaaa 
cgaacgtggc 
gtgtagcggt 
gcgcgtccca 



tccccgaaaa 
tttttgttaa 
atcaaaagaa 
attaaagaac 
actacgtgaa 
tcggaaccct 
gagaaaggaa 
cacgctgcgc 
ttcgcca 



gtgccacctg 
atcagctcat 
tagaccgaga 
gtggactcca 
ccatcaccct 
aaagggagcc 
gggaagaaag 
gtaaccacca 



4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4727 



<210> 17 
<211> 4488 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
pT7-VPCS 



vector 



<400> 17 

aaatcaatct 

gaggcaccta 

gtgtagataa 

cgagacccac 

gagcgcagaa 

gaagctagag 

ggcatcgtgg 

tcaaggcgag 

ccgatcgttg 

cataattctc 

accaagtcat 

cgggataata 

tcggggcgaa 

cgtgcaccca 

acaggaaggc 

atactcttcc 

tacatatttg 

aaagtgccac 

cgtatcacga 

ataatctcat 

tagaaaagat 

aaacaaaaaa 

tttttccgaa 

agccgtagtt 

taatcctgtt 

caagacgata 

agcccagctt 

aaagcgccac 

gaacaggaga 

tcgggtttcg 

gcctatggaa 

ttgctcacat 

ttgagtgagc 

aggaagcgga 

accgcatcag 

ccagtatata 

caacacccgc 

ctgtgaccgt 

cgaggcccag 

gaccacaacg 

tgacctctcg 

atctgtacta 

gccgtggcgc 



aaagtatata 
tctcagcgat 
ctacgatacg 
gctcaccggc 
gtggtcctgc 
taagtagttc 
tgtcacgctc 
ttacatgatc 
tcagaagtaa 
ttactgtcat 
tctgagaata 
ccgcgccaca 
aactctcaag 
actgatcttc 
aaaatgccgc 
tttttcaata 
aatgtattta 
ctgacgtcta 
ggccctttcg 
gaccaaaatc 
caaaggatct 
accaccgcta 
ggtaactggc 
aggccaccac 
accagtggct 
gttaccggat 
ggagcgaacg 
gcttcccgaa 
gcgcacgagg 
ccacctctga 
aaacgccagc 
gttctttcct 
tgataccgct 
agagcgcctg 
atctgatggt 
cactccgcta 
tgacgcgccc 
ctccgggagc 
cgattcgaac 
gtttccctct 
ccgctccgtg 
caccccgtct 
cctacagaca 



tgagtaaact 
ctgtctattt 
ggagggctta 
tccagattta 
aactttatcc 
gccagttaat 
gtcgtttggt 
ccccatgttg 
gttggccgca 
gccatccgta 
gtgtatgcgg 
tagcagaact 
gatcttaccg 
agcatctttt 
aaaaaaggga 
ttattgaagc 
gaaaaataaa 
agaaaccatt 
tcttcaagaa 
ccttaacgtg 
tcttgagatc 
ccagcggtgg 
ttcagcagag 
ttcaagaact 
gctgccagtg 
aaggcgcagc 
acctacaccg 
gggagaaagg 
gagcttccag 
cttgagcgtc 
aacgcggcct 
gcgttatccc 
cgccgcagcc 
atgcggtatt 
gcactctcag 
tcgctacgtg 
tgacgggctt 
tgcatgtgtc 
ttctgataga 
agaaataatt 
aagtcgggtc 
tcaggtatgg 
cgctcgcgcc 



tggtctgaca 
cgttcatcca 
ccatctggcc 
tcagcaataa 
gcctccatcc 
agtttgcgca 
atggcttcat 
tgcaaaaaag 
gtgttatcac 
agatgctttt 
cgaccgagtt 
ttaaaagtgc 
ctgttgagat 
actttcacca 
ataagggcga 
atttatcagg 
caaatagggg 
attatcatga 
ttaaaaggat 
agttttcgtt 
ctttttttct 
tttgtttgcc 
cgcagatacc 
ctgtagcacc 
gcgataagtc 
ggtcgggctg 
aactgagata 
cggacaggta 
ggggaaacgc 
gatttttgtg 
ttttacggtt 
ctgattctgt 
gaacgaccga 
ttctccttac 
tacaatctgc 
actgggtcat 
gtctgctccc 
agaggttttc 
cttcgaaatt 
ttgtttaact 
cgcgggaggt 
cgagtcccga 
agaggggcga 



gttaccaatg 
tagttgcctg 
ccagtgctgc 
accagccagc 
agtctattaa 
acgttgttgc 
tcagctccgg 
cggttagctc 
tcatggttat 
ctgtgactgg 
gctcttgccc 
tcatcattgg 
ccagttcgat 
gcgtttctgg 
cacggaaatg 
gttattgtct 
ttccgcgcac 
cattaaccta 
ctaggtgaag 
ccactgagcg 
gcgcgtaatc 
ggatcaagag 
aaatactgtc 
gcctacatac 
gtgtcttacc 
aacggggggt 
cctacagcgt 
tccggtaagc 
ctggtatctt 
atgctcgtca 
cctggccttt 
ggataaccgt 
gcgcagcgag 
gcatctgtgc 
tctgatgccg 
ggctgcgccc 
ggcatccgct 
accgtcatca 
aatacgactc 
ttaagaagga 
tccgcgcgat 
tagtccgcct 
ggtccgtttc 



cttaatcagt 
actccccgtc 
aatgataccg 
cggaagggcc 
ttgttgccgg 
cattgctaca 
ttcccaacga 
cttcggtcct 
ggcagcactg 
tgagtactca 
ggcgtcaaca 
aaaacgttct 
gtaacccact 
gtgagcaaaa 
ttgaatactc 
catgagcgga 
atttccccga 
taaaaatagg 
atcctttttg 
tcagaccccg 
tgctgcttgc 
ctaccaactc 
cttctagtgt 
ctcgctctgc 
gggttggact 
tcgtgcacac 
gagctatgag 
ggcagggtcg 
tatagtcctg 
ggggggcgga 
tgctggcctt 
attaccgcct 
tcagtgagcg 
ggtatttcac 
catagttaag 
cgacacccgc 
tacagacaag 
ccgaaacgcg 
actataggga 
gatatacata 
gagtacgagg 
gacacctccc 
gtccagtacg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 " 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 
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acgagtcgga ttatgccctc tacgggggct 
tcccccggac gcggcgtccc gtttccgggg 
cgcctccgcc acccgctggg tccggagggg 
ccccccgaac ccagcgggtg gcgtctaagg 
gcggcaggaa atcggcccag ccagaatccg 
cgccaacccg atccaagaca cccgcgcagg 
cccccccaaa ccccgacgcg ccatggaccc 
tctgcgccgc ggtcgggcgc ctggcggcca 
gggacatgtc gcgtccgcgc acagacgaag 
tccgcgtgac ggtctgcgag ggcaaaaacc 
cagacgtggt gcaggacgtc gacgcggcca 
gccccaccga gcgacctcga gccccagccc 
agggtaccga gctcggatcc actagtccag 
gtggcggccg catgtccaat ttactgaccg 
atgcaacgag tgatgaggtt cgcaagaacc 
tttctgagca tacctggaaa atgcttctgt 
agttgaataa ccggaaatgg tttcccgcag 
atcttcaggc gcgcggtctg gcagtaaaaa 
tgcttcatcg tcggtccggg ctgccacgac 
tgcggcggat ccgaaaagaa aacgttgatg 
tcgaacgcac tgatttcgac caggttcgtt 
atatacgtaa tctggcattt ctggggattg 
ttgccaggat cagggttaaa gatatctcac 
ttggcagaac gaaaacgctg gttagcaccg 
taactaaact ggtcgagcga tggatttccg 
acctgttttg ccgggtcaga aaaaatggtg 
caactcgcgc cctggaaggg atttttgaag 
atgactctgg tcagagatac ctggcctggt 
gagatatggc ccgcgctgga gtttcaatac 
atgtaaatat tgtcatgaac tatatccgta 
gcctgctgga agatggcgat ggtatcgaag 
tcggtggtta ataagcttat cgatgataag 



52 

cgtcttccga agacgacgaa cacccggagg 2640 
cggttttgtc cggcccgggg cctgcgcggg 2700 
ccggacgcac acccaccacc gccccccggg 2760 
cccccgcggc cccggcggcg gagaccaccc 2820 
ccgcactccc agacgccccc gcgtcgacgg 2880 
ggctggccag aaagctgcac tttagcaccg 2940 
cccgggtggc cggctttaac aagcgcgtct 3000 
tgcatgcccg gatggcggct gtccagctct 3060 
acctcaacga actccttggc atcaccacca 3120 
tgcttcagcg cgccaacgag ttggtgaatc 3180 
cggcgactcg agggcgttct gcggcgtcgc 3240 
gctccgcttc tcgccccaga cggcccgtcg 3300 
tgtggtggaa ttctgcagat atccagcaca 3360 
tacaccaaaa tttgcctgca ttaccggtcg 3420 
tgatggacat gttcagggat cgccaggcgt 3480 
ccgtttgccg gtcgtgggcg gcatggtgca 3540 
aacctgaaga tgttcgcgat tatcttctat 3600 
ctatccagca acatttgggc cagctaaaca 3660 
caagtgacag caatgctgtt tcactggtta 3720 
ccggtgaaeg cgeaaaacag gctctagcgt 3780 
cactcatgga aaatagcgat cgctgccagg 3840 
cttataacac cctgttacgt atagccgaaa 3900 
gtactgacgg tgggagaatg ttaatccata 3960 
caggtgtaga gaaggcactt agcctggggg 4020 
tctctggtgt agctgatgat ccgaataact 4080 
ttgccgcgcc atctgccacc agccagctat 4140 
caactcatcg attgatttac ggcgctaagg 4200 
ctggacacag tgcccgtgtc ggagccgcgc 4260 
cggagatcat gcaagctggt ggctggacca 4320 
acctggatag tgaaacaggg gcaatggtgc 4380 
gtcgtggtag cgcttggcgt cacccgcagt 4440 
ctgtcaaaca tgagaatt 4488 



<210> 18 
<211> 1125 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: DNA sequence 
coding for a fusion protein TATcreStrepTag 

<220> 

<221> CDS 

<222> (1) . . (1119) 

<400> 18 

atg ggc tac ggc cgc aag aag cgc cgc caa cgc cgc cgc ggc atg tec 48 

Met Gly Tyr Gly Arg Lys Lys Arg Arg Gin Arg Arg Arg Gly Met Ser 
15 10 15 

aat tta ctg acc gta cac caa aat ttg cct gca tta ccg gtc gat gca 96 
Asn Leu Leu Thr Val His Gin Asn Leu Pro Ala Leu Pro Val Asp Ala 
20 25 30 

acg agt gat gag gtt cgc aag aac ctg atg gac atg ttc agg gat cgc 144 
Thr Ser Asp Glu Val Arg Lys Asn Leu Met Asp Met Phe Arg Asp Arg 
35 40 45 

cag gcg ttt tct gag cat acc tgg aaa atg ctt ctg tec gtt tgc egg 192 
Gin Ala Phe Ser Glu His Thr Trp Lys Met Leu Leu Ser Val Cys Arg 
50 55 60 
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teg tgg gcg gca tgg tgc aag ttg aat aac egg aaa tgg ttt ccc gca 240 
Ser Trp Ala Ala Trp Cys Lys Leu Asn Asn Arg Lys Trp Phe Pro Ala 

65 70 75 80 

gaa cct gaa gat gtt cgc gat tat ctt eta tat ctt cag gcg cgc ggt 288 

Glu Pro Glu Asp Val Arg Asp Tyr Leu Leu Tyr Leu Gin Ala Arg Gly 

85 90 95 

ctg aca gta aaa act ate cag caa cat ttg ggc cag eta aac atg ctt 336 

Leu Thr Val Lys Thr He Gin Gin His Leu Gly Gin Leu Asn Met Leu 

100 105 110 

cat cgt egg tec ggg ctg cca cga cca agt gac age aat get gtt tea 384 

His Arg Arg Ser Gly Leu Pro Arg Pro Ser Asp Ser Asn Ala Val Ser 

115 120 125 

ctg gtt atg egg egg ate cga aaa gaa aac gtt gat gec ggt gaa cgt 432 

Leu Val Met Arg Arg He Arg Lys Glu Asn Val Asp Ala Gly Glu Arg 

130 135 140 

gca aaa cag get eta gcg ttc gaa cgc act gat ttc gac cag gtt cgt 480 

Ala Lys Gin Ala Leu Ala Phe Glu Arg Thr Asp Phe Asp Gin Val Arg 

145 150 155 160 

tea etc atg gaa aat age gat cgc tgc cag gat ata cgt aat ctg gca 528 

Ser Leu Met Glu Asn Ser Asp Arg Cys Gin Asp He Arg Asn Leu Ala 

165 170 175 

ttt ctg ggg att get tat aac acc ctg tta cgt ata gee gaa att gec 576 

Phe Leu Gly He Ala Tyr Asn Thr Leu Leu Arg He Ala Glu He Ala 

180 185 190 

agg ate agg gtt aaa gat ate tea cgt act gac ggt ggg aga atg tta 624 

Arg He Arg Val Lys Asp He Ser Arg Thr Asp Gly Gly Arg Met Leu 

195 200 205 

ate cat att ggc aga acg aaa acg ctg gtt age acc gca ggt gta gag 672 

He His He Gly Arg Thr Lys Thr Leu Val Ser Thr Ala Gly Val Glu 

210 215 220 

aag gca ctt age ctg ggg gta act aaa ctg gtc gag cga tgg att tec 720- 

Lys Ala Leu Ser Leu Gly Val Thr Lys Leu Val Glu Arg Trp He Ser 

225 230 235 240 

gtc tct ggt gta get gat gat ccg aat aac tac ctg ttt tgc egg gtc 768 

Val Ser Gly Val Ala Asp Asp Pro Asn Asn Tyr Leu Phe Cys Arg Val 

245 250 255 

aga aaa aat ggt gtt gee gcg cca tct gec acc age cag eta tea act 816 

Arg Lys Asn Gly Val Ala Ala Pro Ser Ala Thr Ser Gin Leu Ser Thr 

260 265 270 

cgc gec ctg gaa ggg att ttt gaa gca act cat cga ttg att tac ggc 864 

Arg Ala Leu Glu Gly He Phe Glu Ala Thr His Arg Leu He Tyr Gly 

275 280 285 

get aag gat gac tct ggt cag aga tac ctg gec tgg tct gga cac agt '912 

Ala Lys Asp Asp Ser Gly Gin Arg Tyr Leu Ala Trp Ser Gly His Ser 

290 295 300 

gec cgt gtc gga gec gcg cga gat atg gee cgc get gga gtt tea ata 960 

Ala Arg Val Gly Ala Ala Arg Asp Met Ala Arg Ala Gly Val Ser He 

305 310 315 320 
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ccg gag ate atg caa get ggt ggc tgg ace aat gta aat att gtc atg 1008 
Pro Glu lie Met Gin Ala Gly Gly Trp Thr Asn Val Asn lie Val Met 
325 330 335 

aac tat ate cgt aac ctg gat agt gaa aca ggg gca atg gtg cgc ctg 1056 
Asn Tyr He Arg Asn Leu Asp Ser Glu Thr Gly Ala Met Val Arg Leu 
340 345 350 

ctg gaa gat ggc gat ggt ate gaa ggt cgt ggt age get tgg cgt cac 1104 
Leu Glu Asp Gly Asp Gly He Glu Gly Arg Gly Ser Ala Trp Arg His 
355 360 365 

ccg cag ttc ggt ggt taataa 1125 
Pro Gin Phe Gly Gly 
370 



<210> 19 
<211> 373 
<212> PRT 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: DNA sequence 
coding for a fusion protein TATcreStrepTag 

<400> 19 

Met Gly Tyr Gly Arg Lys Lys Arg Arg Gin Arg Arg Arg Gly Met Ser 
15 10 15 

Asn Leu Leu Thr Val His Gin Asn Leu Pro Ala Leu Pro Val Asp Ala 
20 25 30 

Thr Ser Asp Glu Val Arg Lys Asn Leu Met Asp Met Phe Arg Asp Arg 
35 40 45 

Gin Ala Phe Ser Glu His Thr Trp Lys Met Leu Leu Ser Val Cys Arg 
50 55 60 

Ser Trp Ala Ala Trp Cys Lys Leu Asn Asn Arg Lys Trp Phe Pro Ala 
65 70 75 80 

Glu Pro Glu Asp Val Arg Asp Tyr Leu Leu Tyr Leu Gin Ala Arg Gly 
- 85 - 90 95 

Leu Thr Val Lys Thr He Gin Gin His Leu Gly Gin Leu Asn Met Leu 
100 105 110 

His Arg Arg Ser Gly Leu Pro Arg Pro Ser Asp Ser Asn Ala Val Ser 
115 120 125 

Leu Val Met Arg Arg He Arg Lys Glu Asn Val Asp Ala Gly Glu Arg 
130 135 140 

Ala Lys Gin Ala Leu Ala Phe Glu Arg Thr Asp Phe Asp Gin Val Arg 
145 150 155 160 

Ser Leu Met Glu Asn Ser Asp Arg Cys Gin Asp He Arg Asn Leu Ala 
165 170 175 

Phe Leu Gly He Ala Tyr Asn Thr Leu Leu Arg He Ala Glu He Ala 
180 185 190 

Arg He Arg Val Lys Asp He Ser Arg Thr Asp Gly Gly Arg Met Leu 
195 200 205 
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lie His He Gly Arg Thr Lys Thr Leu Val Ser Thr Ala Gly Val Glu 
210 215 220 

Lys Ala Leu Ser Leu Gly Val Thr Lys Leu Val Glu Arg Trp He Ser 
225 230 235 240 

Val Ser Gly Val Ala Asp Asp Pro Asn Asn Tyr Leu Phe Cys Arg Val 
245 250 255 

Arg Lys Asn Gly Val Ala Ala Pro Ser Ala Thr Ser Gin Leu Ser Thr 
260 265 270 

'• '.-i 

Arg Ala Leu Glu Gly He Phe Glu Ala Thr His Arg Leu He Tyr Gly 
275 280 285 

Ala Lys Asp Asp Ser Gly Gin Arg Tyr Leu Ala Trp Ser Gly His Ser 
290 295 300 

Ala Arg Val Gly Ala Ala Arg Asp Met Ala Arg Ala Gly Val Ser He 

3Q5 310 315 320 

Pro Glu He Met Gin Ala Gly Gly Trp Thr Asn Val Asn He Val Met 
325 330 335 

Asn Tyr He Arg Asn Leu Asp Ser Glu Thr Gly Ala Met Val Arg Leu 
340 345 350 

Leu Glu Asp Gly Asp Gly He Glu Gly Arg Gly Ser Ala Trp Arg His 
355 360 365 

Pro Gin Phe Gly Gly 
370 



<210> 20 
<211> 2055 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: DNA sequence 
coding for a fusion protein VP22creStrepTag 

<220> 

<221> CDS 

<222> (1) . . (2049) 

<400> 20 

atg acc tct cgc cgc tec gtg aag teg ggt ccg egg gag gtt ccg cgc 48 
Met Thr Ser Arg Arg Ser Val Lys Ser Gly Pro Arg Glu Val Pro Arg 
15 10 15 

gat gag tac gag gat ctg tac tac acc ccg tct tea ggt atg gcg agt 96 
Asp Glu Tyr Glu Asp Leu Tyr Tyr Thr Pro Ser Ser Gly Met Ala Ser 
20 25 30 

ccc gat agt ccg cct gac acc tec cgc cgt ggc gee eta cag aca cgc 144 
Pro Asp Ser Pro Pro Asp Thr Ser Arg Arg Gly Ala Leu Gin Thr Arg 
35 40 45 

teg cgc cag agg ggc gag gtc cgt ttc gtc cag tac gac gag teg gat 192 
Ser Arg Gin Arg Gly Glu Val Arg Phe Val Gin Tyr Asp Glu Ser Asp 
50 55 60 
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tat gcc etc tac ggg ggc teg tct tec gaa gac gac gaa cac ccg gag 240 

Tyr Ala Leu Tyr Gly Gly Ser Ser Ser Glu Asp Asp Glu His Pro Glu 
65 70 75 80 

gtc ccc egg acg egg cgt ccc gtt tec ggg gcg gtt ttg tec ggc ccg 288 
Val Pro Arg Thr Arg Arg Pro Val Ser Gly Ala Val Leu Ser Gly Pro 
85 90 95 

ggg cct gcg egg gcg cct ccg cca ccc get ggg tec gga ggg gcc gga 336 
Gly Pro Ala Arg Ala Pro Pro Pro Pro Ala Gly Ser Gly Gly Ala Gly 
100 105 110 

cgc aca ccc ace acc gcc ccc egg gcc ccc cga acc cag egg gtg gcg 384 
Arg Thr Pro Thr Thr Ala Pro Arg Ala Pro Arg Thr Gin Arg Val Ala 
115 120 125 

t.ct aag gcc ccc gcg gcc ccg gcg gcg gag acc acc cgc ggc agg aaa 432 
Ser Lys Ala Pro Ala Ala Pro Ala Ala Glu Thr Thr Arg Gly Arg Lys 
130 135 140 

teg gcc, cag cca gaa tec gcc gca etc cca gac gcc ccc gcg teg acg 480 
Ser Ala Gin Pro Glu Ser Ala Ala Leu Pro Asp Ala Pro Ala Ser Thr 
145 150 155 160 

gcg cca acc cga tec aag aca ccc gcg cag ggg ctg gcc aga aag ctg 528 
Ala Pro Thr Arg Ser Lys Thr Pro Ala Gin Gly Leu Ala Arg Lys Leu 
165 170 175 

cac ttt age acc gcc ccc cca aac ccc gac gcg cca tgg acc ccc egg 57 6 
His Phe Ser Thr Ala Pro Pro Asn Pro Asp Ala Pro Trp Thr Pro Arg 
180 185 190 

gtg gcc ggc ttt aac aag cgc gtc ttc tgc gcc gcg gtc ggg cgc ctg 624 
Val Ala Gly Phe Asn Lys Arg Val Phe Cys Ala Ala Val Gly Arg Leu 
195 200 205 

gcg gcc atg cat gcc egg atg geg get gtc cag etc tgg gac atg teg 672 
Ala Ala Met His Ala Arg Met Ala Ala Val Gin Leu Trp Asp Met Ser 
210 215 220 

cgt ccg cgc aca gac gaa gac etc aac gaa etc ctt ggc ate acc acc 720 
Arg Pro Arg Thr Asp Glu Asp Leu Asn Glu Leu Leu Gly lie Thr Thr 
225 • 230 235 240 

ate cgc gtg acg gtc tgc gag ggc aaa aac ctg ctt cag cgc gcc aac 768 
lie Arg Val Thr Val Cys Glu Gly Lys Asn Leu Leu Gin Arg Ala Asn 
245 250 255 

gag ttg gtg aat cca gac gtg gtg cag gac gtc gac gcg - gcc acg gcg 816 
Glu Leu Val Asn Pro Asp Val Val Gin Asp Val Asp Ala Ala Thr Ala 
260 265 270 

act cga ggg cgt tct gcg gcg teg cgc ccc acc gag cga cct cga gcc 864 
Thr Arg Gly Arg Ser Ala Ala Ser Arg Pro Thr Glu Arg Pro Arg Ala 
275 280 285 

cca gcc cgc tec get tct cgc ccc aga egg ccc gtc gag ggt acc gag 912 
Pro Ala Arg Ser Ala Ser Arg Pro Arg Arg Pro Val Glu Gly Thr Glu 
290 295 300 

etc gga tec act agt cca gtg tgg tgg aat tct gca gat ate cag cac 960 
Leu Gly Ser Thr Ser Pro Val Trp Trp Asn Ser Ala Asp lie Gin His 
305 310 315 320 
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agt ggc ggc cgc atg tec aat tta ctg acc gta cac caa aat ttg cct 1008 

Ser Gly Gly Arg Met Ser Asn Leu Leu Thr Val His Gin Asn Leu Pro 
325 330 335 

gca tta ccg gtc gat gca acg agt gat gag gtt cgc aag aac ctg atg 1056 
Ala Leu Pro Val Asp Ala Thr Ser Asp Glu Val Arg Lys Asn Leu Met 
340 . 345 350 

gac atg ttc agg gat cgc cag gcg ttt tct gag cat acc tgg aaa atg 1104 
Asp Met Phe Arg Asp Arg Gin Ala Phe Ser Glu His Thr Trp Lys Met 
355 360 365 ..... 

ctt ctg tec gtt tgc egg teg tgg gcg gca tgg tgc aag ttg aat aac 1152 
Leu Leu Ser Val Cys Arg Ser Trp Ala Ala Trp Cys Lys Leu Asn Asn 
370 375 380 

egg aaa tgg ttt ccc gca gaa cct gaa gat gtt cgc gat tat ctt eta 1200 
Arg Lys Trp Phe Pro Ala Glu Pro Glu Asp Val Arg Asp Tyr Leu Leu 
385 390 395 400 

tat ctt cag gcg cgc ggt ctg gca gta aaa act ate cag caa cat ttg 1248 
Tyr Leu Gin Ala Arg Gly Leu Ala Val Lys Thr lie Gin Gin His Leu 
405 410 415 

ggc cag eta aac atg ctt cat cgt egg tec ggg ctg cca cga cca agt 1296 
Gly Gin Leu Asn Met Leu His Arg Arg Ser Gly Leu Pro Arg Pro Ser 
420 425 430 

gac age aat get gtt tea ctg gtt atg egg egg ate cga aaa gaa aac 1344 
Asp Ser Asn Ala Val Ser Leu Val Met Arg Arg lie Arg Lys Glu Asn 
435 440 445 

gtt gat gee ggt gaa cgt gca aaa cag get eta gcg ttc gaa cgc act 1392 
Val Asp Ala Gly Glu Arg Ala Lys Gin Ala Leu Ala Phe Glu Arg Thr 
450 455 460 

gat ttc gac cag gtt cgt tea etc atg gaa aat age gat cgc tgc cag 1440 
Asp Phe Asp Gin Val Arg Ser Leu Met Glu Asn Ser Asp Arg Cys Gin 
465 470 475 480 

gat ata cgt aat ctg gca ttt ctg ggg att get tat aac acc ctg tta 1488 
Asp lie Arg Asn Leu Ala Phe Leu Gly lie Ala Tyr Asn Thr Leu Leu 
485 490 495 

cgt ata gee gaa att gee agg ate agg gtt aaa gat ate tea cgt act 1536 
Arg lie Ala Glu He Ala Arg He Arg Val Lys Asp He Ser Arg Thr 
500 505 510 

gac ggt ggg aga atg tta ate cat att ggc aga acg aaa acg ctg gtt 1584 
Asp Gly Gly Arg Met Leu He His He Gly Arg Thr Lys Thr Leu Val 
515 520 525 

age acc gca ggt gta gag aag gca ctt age ctg ggg gta act aaa ctg 1632 
Ser Thr Ala Gly Val Glu Lys Ala Leu Ser Leu Gly Val Thr Lys Leu 
530 535 540 

gtc gag cga tgg att tec gtc tct ggt gta get gat gat ccg aat aac 1680 
Val Glu Arg Trp He Ser Val Ser Gly Val Ala Asp Asp Pro Asn Asn 
545 550 555 560 

tac ctg ttt tgc egg gtc aga aaa aat ggt gtt gec gcg cca tct gee 1728 
Tyr Leu Phe Cys Arg Val Arg Lys Asn Gly Val Ala Ala Pro Ser Ala 
565 570 575 
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acc age cag eta tea act cgc gec ctg gaa ggg att ttt gaa gca act 1776 
Thr Ser Gin Leu Ser Thr Arg Ala Leu Glu Gly lie Phe Glu Ala Thr 
580 585 590 

cat cga ttg att tac ggc get aag gat gac tct ggt cag aga tac ctg 1824 
His Arg Leu lie Tyr Gly Ala Lys Asp Asp Ser Gly Gin Arg Tyr Leu 
595 600 605 

gec tgg tct gga cac agt gec cgt gtc gga gee gcg cga gat atg gec 1872 

Ala Trp Ser Gly His Ser Ala Arg Val Gly Ala Ala Arg Asp Met Ala 

610 615 620 

j J 

cgc get gga gtt tea ata ccg gag ate atg caa get ggt ggc tgg acc 1920 

Arg Ala Gly Val Ser lie Pro Glu lie Met Gin Ala Gly Gly Trp Thr 
625 630 635 640 

aat gta aat att gtc atg aac tat ate cgt aac ctg gat agt gaa aca 1968 
Asn Val Asn lie Val Met Asn Tyr lie Arg Asn Leu Asp Ser Glu Thr 
645 650 655 

ggg gca atg gtg cgc ctg ctg gaa gat ggc gat ggt ate gaa ggt cgt 2016 
Gly Ala Met Val Arg Leu Leu Glu Asp Gly Asp Gly He Glu Gly Arg 
660 665 670 

ggt age get tgg cgt cac ccg cag ttc ggt ggt taataa 2055 
Gly Ser Ala Trp Arg His Pro Gin Phe Gly Gly 
675 680 

<210> 21 
<211> 683 
<212> PRT 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: DNA sequence 
coding for a fusion protein VP22creStrepTag 

<400> 21 

Met Thr Ser Arg Arg Ser Val Lys Ser Gly Pro Arg Glu Val Pro Arg 
15 10 15 

Asp Glu Tyr Glu Asp Leu Tyr Tyr Thr Pro Ser Ser Gly Met Ala Ser 
20 25 30 

Pro Asp Ser Pro Pro Asp Thr Ser Arg Arg Gly Ala Leu Gin Thr Arg 
35 40 45 

Ser Arg Gin Arg Gly Glu Val Arg Phe Val Gin Tyr Asp Glu Ser Asp 
.50 55 60 

Tyr Ala Leu Tyr Gly Gly Ser Ser Ser Glu Asp Asp Glu His Pro Glu 
65 70 75 80 

Val Pro Arg Thr Arg Arg Pro Val Ser Gly Ala Val Leu Ser Gly Pro 
85 90 95 

Gly Pro Ala Arg Ala Pro Pro Pro Pro Ala Gly Ser Gly Gly Ala Gly 

ioo 105 no/"*;* 

Arg Thr Pro Thr Thr Ala Pro Arg Ala Pro Arg Thr Gin Arg Val Ala 
115 120 125 

Ser Lys Ala Pro Ala Ala Pro Ala Ala Glu Thr Thr Arg Gly Arg Lys 
130 135 140 
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Ser Ala Gin Pro Glu Ser Ala Ala Leu Pro Asp Ala Pro Ala Ser Thr 
145 150 155 160 

Ala Pro Thr Arg Ser Lys Thr Pro Ala Gin Gly Leu Ala Arg Lys Leu 
165 170 175 

His Phe Ser Thr Ala Pro Pro Asn Pro Asp Ala Pro Trp Thr Pro Arg 
180 185 190 

Val Ala Gly Phe Asn Lys Arg Val Phe Cys Ala Ala Val Gly Arg Leu 
195 200 205 

■ r 
• > 

Ala Ala Met His Ala Arg Met Ala Ala Val Gin Leu Trp Asp Met Ser 
210 215 220 

Arg Pro Arg Thr Asp Glu Asp Leu Asn Glu Leu Leu Gly lie Thr Thr 
225 230 235 240 

lie Arg Val Thr Val Cys Glu Gly Lys Asn Leu Leu Gin Arg Ala Asn 
245 250 255 

Glu Leu Val Asn Pro Asp Val Val Gin Asp Val Asp Ala Ala Thr Ala 
260 265 270 

Thr Arg Gly Arg Ser Ala Ala Ser Arg Pro Thr Glu Arg Pro Arg Ala 
275 280 285 

Pro Ala Arg Ser Ala Ser Arg Pro Arg Arg Pro Val Glu Gly Thr Glu 
290 295 300 

Leu Gly Ser Thr Ser Pro Val Trp Trp Asn Ser Ala Asp He Gin His 
305 310 315 320 

Ser Gly Gly Arg Met Ser Asn Leu Leu Thr Val His Gin Asn Leu Pro 
325 330 335 

Ala Leu Pro Val Asp Ala Thr Ser Asp Glu Val Arg Lys Asn Leu Met 
340 345 350 

Asp Met Phe Arg Asp Arg Gin Ala Phe Ser Glu His Thr Trp Lys Met 
355 360 365 

Leu Leu Ser Val Cys Arg Ser Trp Ala Ala Trp Cys Lys Leu Asn Asn 
370 375 380 

Arg Lys Trp Phe Pro Ala Glu Pro Glu Asp Val Arg Asp Tyr Leu Leu 
385 390 _ . 395 400 

Tyr Leu Gin Ala Arg Gly Leu Ala Val Lys Thr He Gin Gin His Leu 
405 410 415 

Gly Gin Leu Asn Met Leu His Arg Arg Ser Gly Leu Pro Arg Pro Ser 
420 425 430 

Asp Ser Asn Ala Val Ser Leu Val Met Arg Arg He Arg Lys Glu Asn 
435 440 445 

Val Asp Ala Gly Glu Arg Ala Lys Gin Ala Leu Ala Phe Glu Arg Thr 
450 455 460 



Asp Phe Asp Gin Val Arg Ser Leu Met Glu Asn Ser Asp Arg Cys Gin 
465 470 475 480 
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Asp lie Arg Asn Leu Ala Phe Leu Gly lie Ala Tyr Asn Thr Leu Leu 
485 490 495 

Arg He Ala Glu He Ala Arg lie Arg Val Lys Asp He Ser Arg Thr 
500 505 510 

Asp Gly Gly Arg Met Leu He His He Gly Arg Thr Lys Thr Leu Val 
515 520 525 

Ser Thr Ala Gly Val Glu Lys Ala Leu Ser Leu Gly Val Thr Lys Leu 
530 535 540 

i i 

Val Glu Arg Trp He Ser Val Ser Gly Val Ala Asp Asp Pro Asn Asn 
545 550 555 560 

Tyr Leu Phe Cys Arg Val Arg Lys Asn Gly Val Ala Ala Pro Ser Ala 
565 570 575 

Thr Ser Gin Leu Ser Thr Arg Ala Leu Glu Gly He Phe Glu Ala Thr 
580 585 590 

His Arg Leu He Tyr Gly Ala Lys Asp Asp Ser Gly Gin Arg Tyr Leu 
595 600 605 

Ala Trp Ser Gly His Ser Ala Arg Val Gly Ala Ala Arg Asp Met Ala 
610 615 620 

Arg Ala Gly Val Ser He Pro Glu He Met Gin Ala Gly Gly Trp Thr 
625 630 635 640 

Asn Val Asn He Val Met Asn Tyr He Arg Asn Leu Asp Ser Glu Thr 
645 650 655 

Gly Ala Met Val Arg Leu Leu Glu Asp Gly Asp Gly He Glu Gly Arg 
660 665 670 

Gly Ser Ala Trp Arg His Pro Gin Phe Gly Gly 
675 680 



<210> 22 
<211> 11 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic TAT 
protein 

<400> 22 

Ala Gly Arg Lys Lys Arg Arg Gin Arg Arg Arg 
15 10 



<210> 23 
<211> 11 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic TAT 
protein 
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<400> 23 

Tyr Ala Arg Lys Ala Arg Arg Gin Ala Arg Arg 
15 10 



<210> 24 
<211> 11 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic TAT 
protein 

<400> 24 

Tyr Ala Arg Ala Ala Ala Arg Gin Ala Arg Ala 
15 10 



<210> 25 
<2ll> 11 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic TAT 
protein 

<400> 25 

Tyr Ala Arg Ala Ala Arg Arg Ala Ala Arg Arg 
15 10 



<210> 26 
<211> 11 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic TAT 
protein 

<400> 26 

Tyr Ala Arg Ala Ala Arg Arg Ala Ala Arg Ala 
15 10 



<210> 27 
<211> 11 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic TAT 
protein 

<400> 27 

Tyr Ala Arg Arg Arg Arg Arg Arg Arg Arg Arg 
15 10 



<210> 28 
<211> 11 
<212> PRT 
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<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: synthetic TAT 
protein 

<400> 28 

Tyr Ala Ala Ala Arg Arg Arg Arg Arg Arg Arg 
15 10 



<210> 29 
<211> 4960 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: vector 
pCMV-I-Cre-pA 

<4G0> 29 

aaacagtccg atgtacgggc cagatatacg cgttgacatt gattattgac tagttattaa 60 

tagtaatcaa ttacggggtc attagttcat agcccatata tggagttccg cgttacataa 120 

cttacggtaa atggcccgcc tggctgaccg cccaacgacc cccgcccatt gacgtcaata 180 

atgacgtatg ttcccatagt aacgccaata gggactttcc attgacgtca atgggtggac 240 

tatttacggt aaactgccca cttggcagta catcaagtgt atcatatgcc aagtacgccc 300 

cctattgacg tcaatgacgg taaatggccc gcctggcatt atgcccagta catgacctta 360 

tgggactttc ctacttggca gtacatctac gtattagtca tcgctattac catggtgatg 420 

cggttttggc agtacatcaa tgggcgtgga tagcggtttg actcacgggg atttccaagt 480 

ctccacccca ttgacgtcaa tgggagtttg ttttggcacc aaaatcaacg ggactttcca 540 

aaatgtcgta acaactccgc cccattgacg caaatgggcg gtaggcgtgt acggtgggag 600 

gtctatataa gcagagctct ctggctaact agagaaccca ctgcttactg gcttatcgaa 660 

attaatacga ctcactatag ggagacccaa gctgactcta gacttaatta agcgttgggg 720 

tgagtactcc ctctcaaaag cgggcatgac ttctgcgcta agattgtcag tttccaaaaa 780 

cgaggaggat ttgatattca cctggcccgc ggtgatgcct ttgagggtgg ccgcgtccat 840 

ctggtcagaa aagacaatct ttttgttgtc aagcttgagg tgtggcaggc ttgagatctg 900 

gccatacact tgagtgacat tgacatccac tttgcctttc tctccacagg tgtccactcc 960 

cagggcggcc tcgaccatgc ccaagaagaa gaggaaggtg tccaatttac tgaccgtaca 1020 

ccaaaatttg cctgcattac cggtcgatgc aacgagtgat gaggttcgca agaacctgat 1080 

ggacatgttc agggatcgcc aggcgttttc tgagcatacc tggaaaatgc ttctgtccgt 1140 

ttgccggtcg tgggcggcat ggtgcaagtt gaataaccgg aaatggtttc ccgcagaacc 1200 

tgaagatgtt cgcgattatc ttctatatct tcaggcgcgc ggtctggcag taaaaactat 1260 

ccagcaacat ttgggccagc taaacatgct tcatcgtcgg tccgggctgc cacgaccaag 1320 

tgacagcaat gctgtttcac tggttatgcg gcggatccga aaagaaaacg ttgatgccgg 1380 

tgaacgtgca aaacaggctc tagcgttcga acgcactgat ttcgaccagg ttcgttcact 1440 

catggaaaat agcgatcgct gccaggatat acgtaatctg gcatttctgg ggattgctta 1500 

taacaccctg ttacgtatag ccgaaattgc caggatcagg gttaaagata tctcacgtac 1560 

tgacggtggg "agaatgttaa tccatattgg cagaacgaaa acgctggtta gcaccgcagg 1620 

tgtagagaag gcacttagcc tgggggtaac taaactggtc gagcgatgga tttccgtctc 1680 

tggtgtagct gatgatccga ataactacct gttttgccgg gtcagaaaaa atggtgttgc 17 40 

cgcgccatct gccaccagcc agctatcaac tcgcgccctg gaagggattt ttgaagcaac ' 1800 

tcatcgattg atttacggcg ctaaggatga ctctggtcag agatacctgg cctggtctgg 1860 

acacagtgcc cgtgtcggag ccgcgcgaga tatggcccgc gctggagttt caataccgga 1920 

gatcatgcaa gctggtggct ggaccaatgt aaatattgtc atgaactata tccgtaacct 1980 

ggatagtgaa acaggggcaa tggtgcgcct gctggaagat ggcgattagc cattaacgcg 2040 

taaatgattg cagatccact agttctaggg ccgcgtcgac ctcgagatcc aggcgcggat 2100 

caataaaaga tcattatttt caatagatct gtgtgttggt tttttgtgtg ccttggggga 2160 

gggggaggcc agaatgaggc gcggccaagg gggaggggga ggccagaatg accttggggg 2220 

agggggaggc cagaatgacc ttgggggagg gggaggccag aatgaggcgc gcccccgggt 2280 

accgagctcg aattcactgg ccgtcgtttt acaacgtcgt gactgggaaa accctggcgt 2340 

tacccaactt aatcgccttg cagcacatcc ccctttcgcc agctggcgta atagcgaaga 2400 

ggcccgcacc gatcgccctt cccaacagtt 'gcgcagcctg aatggcgaat ggcgcctgat 2460 

gcggtatttt ctccttacgc atctgtgcgg tatttcacac cgcatatggt gcactctcag 2520 

tacaatctgc tctgatgccg catagttaag ccagccccga cacccgccaa cacccgctga 2580 
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cgcgccctga cgggcttgtc tgctcccggc atccgcttac agacaagctg tgaccgtctc 2640 
cgggagctgc atgtgtcaga ggttttcacc gtcatcaccg aaacgcgcga gacgaaaggg 2700 
cctcgtgata cgcctatttt tataggttaa tgtcatgata ataatggttt cttagacgtc 2760 
aggtggcact tttcggggaa atgtgcgcgg aacccctatt tgtttatttt tctaaataca 2820 
ttcaaatatg tatccgctca tgagacaata accctgataa atgcttcaat aatattgaaa 2880 
aaggaagagt atgagtattc aacatttccg tgtcgccctt attccctttt ttgcggcatt 2940 
ttgccttcct gtttttgctc acccagaaac gctggtgaaa gtaaaagatg ctgaagatca 3000 
gttgggtgca cgagtgggtt acatcgaact ggatctcaac agcggtaaga tccttgagag 3060 
ttttcgcccc gaagaacgtt ttccaatgat gagcactttt aaagttctgc tatgtggcgc 3120 
ggtattatcc cgtattgacg ccgggcaaga gcaactcggt cgccgcatac actattctca 3180 
gaatgacttg gttgagtact caccagtcac agaaaagcat cttacggatg gcatgacagt 3240 
aagagaatta tgcagtgctg ccataaccat gagtgataac actgcggcca, acttacttct 3300 
gacaacgatc ggaggaccga aggagctaac cgcttttttg cacaacatgg gggatcatgt 3360 
aactcgcctt gatcgttggg aaccggagct gaatgaagcc ataccaaacg acgagcgtga 3420 
caccacgatg cctgtagcaa tggcaacaac gttgcgcaaa ctattaactg gcgaactact 3480 
tactctagct tcccggcaac aattaataga ctggatggag gcggataaag ttgcaggacc 3540 
acttctgcgc tcggcccttc cggctggctg gtttattgct gataaatctg gagccggtga 3600 
gcgtgggtct cgcggtatca ttgcagcact ggggccagat ggtaagccct cccgtatcgt 3660 
agttatctac acgacgggga gtcaggcaac tatggatgaa cgaaatagac agatcgctga 3720 
gataggtgcc tcactgatta agcattggta actgtcagac caagtttact catatatact 3780 
tragattgat ttaaaacttc atttttaatt taaaaggatc taggtgaaga tcctttttga 3840 
taatctcatg accaaaatcc cttaacgtga gttttcgttc cactgagcgt cagaccccgt 3900 
agaaaagatc aaaggatctt cttgagatcc tttttttctg cgcgtaatct gctgcttgca 3960 
aacaaaaaaa ccaccgctac cagcggtggt ttgtttgccg gatcaagagc taccaactct 4020 
ttttccgaag gtaactggct tcagcagagc gcagatacca aatactgtcc ttctagtgta 4080 
gccgtagtta ggccaccact tcaagaactc tgtagcaccg cctacatacc tcgctctgct 4140 
aatcctgtta ccagtggctg ctgccagtgg cgataagtcg tgtcttaccg ggttggactc 4200 
aagacgatag ttaccggata aggcgcagcg gtcgggctga acggggggtt cgtgcacaca 4260 
gcccagcttg gagcgaacga cctacaccga actgagatac ctacagcgtg agctatgaga 4320 
aagcgccacg cttcccgaag ggagaaaggc ggacaggtat ccggtaagcg gcagggtcgg 4380 
aacaggagag cgcacgaggg agcttccagg gggaaacgcc tggtatcttt atagtcctgt 4440 
cgggtttcgc cacctctgac ttgagcgtcg atttttgtga tgctcgtcag gggggcggag 4500 
cctatggaaa aacgccagca acgcggcctt tttacggttc ctggcctttt gctggccttt 4560 
tgctcacatg ttctttcctg cgttatcccc tgattctgtg gataaccgta ttaccgcctt 4620 
tgagtgagct gataccgctc gccgcagccg aacgaccgag cgcagcgagt cagtgagcga 4 680 
ggaagcggaa gagcgcccaa tacgcaaacc gcctctcccc gcgcgttggc cgattcatta 4740 
atgcagctgg cacgacaggt ttcccgactg gaaagcgggc agtgagcgca acgcaattaa 4800 
tgtgagttag ctcactcatt aggcacccca ggctttacac tttatgcttc cggctcgtat 4860 
gttgtgtgga attgtgagcg gataacaatt tcacacagga aacagctatg accatgatta 4920 
cgccaagcta gcccgggcta gcttgcatgc ctgcaggttt 4960 



<210> 30 
<2U> 7332 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: vector 
pCMV-I-beta-pA 



<400> 30 

aaacagtccg 

tagtaatcaa 

cttacggtaa 

atgacgtatg 

tatttacggt 

cctattgacg 

tgggactttc 

cggttttggc 

ctccacccca 

aaatgtcgta 

gtctatataa 

attaatacga 



atgtacgggc 
ttacggggtc 
atggcccgcc 
ttcccatagt 
aaactgccca 
tcaatgacgg 
ctacttggca 
agtacatcaa 
ttgacgtcaa 
acaactccgc 
gcagagctct 
ctcactatag 



cagatatacg 
attagttcat 
tggctgaccg 
aacgccaata 
cttggcagta 
taaatggccc 
gtacatctac 
tgggcgtgga 
tgggagtttg 
cccattgacg 
ctggctaact 
ggagacccaa 



cgttgacatt 
agcccatata 
cccaacgacc 
gggactttcc 
catcaagtgt 
gcctggcatt 
gtattagtca 
tagcggtttg 
ttttggcacc 
caaatgggcg 
agagaaccca 
gctgactcta 



gattattgac 
tggagttccg 
cccgcccatt 
attgacgtca 
atcatatgcc 
atgcccagta 
tcgctattac 
actcacgggg 
aaaatcaacg 
gtaggcgtgt 
ctgcttactg 
gacttaatta 



tagttattaa 60 
cgttacataa 120 
gacgtcaata 180 
atgggtggac 240 
aagtacgccc 300 
catgacctta 360 
catggtgatg 420 
atttccaagt 480 
ggactttcca 540 
acggtgggag 600 
gcttatcgaa 660 
agcgttgggg 720 
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tgagtactcc ctctcaaaag 
cgaggaggat ttgatattca 
ctggtcagaa aagacaatct 
gccatacact tgagtgacat 
cagggcggcc gcaattcccg 
tgtcgtttac tttgaccaac 
tggacaccag caaggagctg 
aaaaccctgg cgttacccaa 
gtaatagcga agaggcccgc 
aatggcgctt tgcctggttt 
atcttcctga ggccgatact 
cgcccatcta caccaacgta 
agaatccgac gggttgttac 
gccagacgcg aattattttt 
gctgggtcgg ttacggccag 
tacgcgccgg agaaaaccgc 
tggaagatca ggatatgtgg 
aaccgactac acaaatcagc 
gcgctgtact ggaggctgaa 
cagtttcttt atggcagggt 
aaattatcga tgagcgtggt 
acccgaaact gtggagcgcc 
ccgccgacgg cacgctgatt 
ttgaaaatgg tctgctgctg 
acgagcatca tcctctgcat 
tgctgatgaa gcagaacaac 
tgtggtacac gctgtgcgac 
cccacggcat ggtgccaatg 
gcgaacgcgt aacgcgaatg 
cgctggggaa tgaatcaggc 
ctgtcgatcc ttcccgcccg 
atattatttg cccgatgtac 
aatggtccat caaaaaatgg 
aatacgccca cgcgatgggt 
gtcagtatcc ccgtttacag 
aatatgatga aaacggcaac 
acgatcgcca gttctgtatg 
tgacggaagc aaaacaccag 
aagtgaccag cgaatacctg 
cgctggatgg taagccgctg 
aacagttgat tgaactgcct 
cagtacgcgt agtgcaaccg 
ggcagcagtg gcgtctggcg 
tcccgcatct gaccaccagc 
aatttaaccg ccagtcaggc 
tgacgccgct gcgcgatcag 
aagcgacccg cattgaccct 
aggccgaagc agcgttgttg 
cgaccgctca cgcgtggcag 
ggattgatgg tagtggtcaa 
cgcatccggc gcggattggc 
ggctcggatt agggccgcaa 
gctgggatct gccattgtca 
tgcgctgcgg gacgcgcgaa 
tcaacatcag ccgctacagt 
acgcggaaga aggcacatgg 
actcctggag cccgtcagta 
agttggtctg gtgtcaaaaa 
taaggaaatc cattatgtac 
ttttctttta cttttttatc 
acatcaacca tatcagcaaa 
cgctattatt ccaaccgctg 
ggccgcgtcg acctcgagat 
ctgtgtgttg gttttttgtg 
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cgggcatgac ttctgcgcta 
cctggcccgc ggtgatgcct 
ttttgttgtc aagcttgagg 
tgacatccac tttgcctttc 
gggatcgaaa gagcctgcta 
aagaacgtga ttttcgttgc 
ctcaagcgcg atcccgtcgt 
cttaatcgcc ttgcagcaca 
accgatcgcc cttcccaaca 
ccggcaccag aagcggtgcc 
gtcgtcgtcc cctcaaactg 
acctatccca ttacggtcaa 
tcgctcacat ttaatgttga 
gatggcgtta actcggcgtt 
gacagtcgtt tgccgtctga 
ctcgcggtga tggtgctgcg 
cggatgagcg gcattttccg 
gatttccatg ttgccactcg 
gttcagatgt gcggcgagtt 
gaaacgcagg tcgccagcgg 
ggttatgccg atcgcgtcac 
gaaatcccga atctctatcg 
gaagcagaag cctgcgatgt 
ctgaacggca agccgttgct 
ggtcaggtca tggatgagca 
tttaacgccg tgcgctgttc 
cgctacggcc tgtatgtggt 
aatcgtctga ccgatgatcc 
gtgcagcgcg atcgtaatca 
cacggcgcta atcacgacgc 
gtgcagtatg aaggcggcgg 
gcgcgcgtgg atgaagacca 
ctttcgctac ctggagagac 
aacagtcttg gcggtttcgc 
ggcggcttcg tctgggactg 
ccgtggtcgg cttacggcgg 
aacggtctgg tctttgccga 
cagcagtttt tccagttccg 
ttccgtcata gcgataacga 
gcaagcggtg aagtgcctct 
gaactaccgc agccggagag 
aacgcgaccg catggtcaga 
gaaaacctca gtgtgacgct 
gaaatggatt tttgcatcga 
tttctttcac agatgtggat 
ttcacccgtg caccgctgga 
aacgcctggg tcgaacgctg 
cagtgcacgg cagatacact 
catcagggga aaaccttatt 
atggcgatta ccgttgatgt 
ctgaactgcc agctggcgca 
gaaaactatc ccgaccgcct 
gacatgtata ccccgtacgt 
ttgaattatg gcccacacca 
caacagcaac tgatggaaac 
ctgaatatcg acggtttcca 
tcggcggaat tacagctgag 
taataataac cgggcaggcc 
tatttaaaaa acacaaactt 
atgggagcct acttcccgtt 
agtgatacgg gtattatttt 
tttggtctgc' tttctgacaa 
ccaggcgcgg atcaataaaa 
tgccttgggg gagggggagg 



agattgtcag tttccaaaaa 780 
ttgagggtgg ccgcgtccat 840 
tgtggcaggc ttgagatctg 900 
tctccacagg tgtccactcc 960 
aagcaaaaaa gaagtcacca 1020 
cggtctggga ggcattggtc 1080 
tttacaacgt cgtgactggg 1140 
tccccctttc gccagctggc 1200 
gttgcgcagc ctgaatggcg 1260 
ggaaagctgg ctggagtgcg 1320 
gcagatgcac ggttacgatg 1380 
tccgccgttt gttcccacgg 1440 
tgaaagctgg ctacaggaag 1500 
tcatctgtgg tgcaacgggc 1560 
atttgacctg agcgcatttt 1620 
ttggagtgac ggcagttatc 1680 
tgacgtctcg ttgctgcata 1740 
ctttaatgat gatttcagcc 1800 
gcgtgactac ctacgggtaa 1860 
caccgcgcct ttcggcg.gtg 1920 
actacgtctg aacgtcgaaa 1980 
tgcggtggtt gaactgcaca 2040 
cggtttccgc gaggtgcgga 2100 
gattcgaggc gttaaccgtc 2160 
gacgatggtg caggatatcc 2220 
gcattatccg aaccatccgc 2280 
ggatgaagcc aatattgaaa 2340 
gcgctggcta ccggcgatga 2400 
cccgagtgtg atcatctggt 2460 
gctgtatcgc tggatcaaat 2520 
agccgacacc acggccaccg 2580 
gcccttcccg gctgtgccga 2640 
gcgcccgctg atcctttgcg 2700 
taaatactgg caggcgtttc 2760 
ggtggatcag tcgctgatta 2820 
tgattttggc gatacgccga 2880 
ccgcacgccg catccagcgc 2940 
tttatccggg caaaccatcg 3000 
gctcctgcac tggatggtgg 3060 
ggatgtcgct ccacaaggta 3120 
cgccgggcaa ctctggctca 3180 
agccgggcac atcagcgcct 3240 
ccccgccgcg tcccacgcca 3300 
gctgggtaat aagcgttggc 3360 
tggcgataaa aaacaactgc 3420 
taacgacatt ggcgtaagtg 3480 
gaaggcggcg ggccattacc 3540 
tgctgatgcg gtgctgatta 3600 
tatcagccgg-aaaacctacc 3660 
tgaagtggcg agcgatacac 3720 
ggtagcagag cgggtaaact 3780 
tactgccgcc tgttttgacc 3840 
cttcccgagc gaaaacggtc 3900 
gtggcgcggc gacttccagt 3960 
cagccatcgc catctgctgc 4020 
tatggggatt ggtggcgacg 4080 
cgccggtcgc taccattacc 4140 
atgtctgccc gtatttcgcg 4200 
ttggatgttc ggtttattct 4260 
tttcccgatt tggctacatg 4320 
tgccgctatt tctctgttct 4380 
actcggcctc gactctaggc 44 40 
gatcattatt ttcaatagat 4500 
ccagaatgag gcgcggccaa 4560 
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gggggagggg gaggccagaa tgaccttggg 
gggggaggcc agaatgaggc gcgcccccgg 
ttacaacgtc gtgactggga aaaccctggc 
ccccctttcg ccagctggcg taatagcgaa 
ttgcgcagcc tgaatggcga atggcgcctg 
ggtatttcac accgcatatg gtgcactctc 
agccagcccc gacacccgcc aacacccgct 
gcatccgctt acagacaagc tgtgaccgtc 
ccgtcatcac cgaaacgcgc gagacgaaag 
aatgtcatga taataatggt ttcttagacg 
ggaaccccta tttgtttatt tttctaaata 
taaccctgat aaatgcttca ataatattga 
cgtgtcgccc ttattccctt ttttgcggca 
acgctggtga aagtaaaaga tgctgaagat 
ctggatctca acagcggtaa gatccttgag 
atgagcactt ttaaagttct gctatgtggc 
gagcaactcg gtcgccgcat acactattct 
acagaaaagc atcttacgga tggcatgaca 
atgagtgata acactgcggc caacttactt 
accgcttttt tgcacaacat gggggatcat 
ctgaatgaag ceataceaaa cgacgagcgt 
acgttgcgca aactattaac tggcgaacta 
gactggatgg aggcggataa agttgcagga 
tggtttattg ctgataaatc tggagccggt 
ctggggccag atggtaagcc ctcccgtatc 
actatggatg aacgaaatag acagatcgct 
taactgtcag accaagttta ctcatatata 
tttaaaagga tctaggtgaa gatccttttt 
gagttttcgt tccactgagc gtcagacccc 
cctttttttc tgcgcgtaat ctgctgcttg 
gtttgtttgc cggatcaaga gctaccaact 
gcgcagatac caaatactgt ccttctagtg 
tctgtagcac cgcctacata cctcgctctg 
ggcgataagt cgtgtcttac cgggttggac 
cggtcgggct gaacgggggg ttcgtgcaca 
gaactgagat acctacagcg tgagctatga 
gcggacaggt atccggtaag cggcagggtc 
gggggaaacg cctggtatct ttatagtcct 
cgatttttgt gatgctcgtc aggggggcgg 
tttttacggt tcctggcctt ttgctggcct 
cctgattctg tggataaccg tattaccgcc 
cgaacgaccg agcgcagcga gtcagtgagc 
ccgcctctcc ccgcgcgttg gccgattcat 
tggaaagcgg gcagtgagcg caacgcaatt 
caggctttac actttatgct tccggctcgt 
tttcacacag gaaacagcta tgaccatgat 
gcctgcaggt tt 
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ggagggggag gccagaatga ccttggggga 4 620 
gtaccgagct cgaattcact ggccgtcgtt 4 680 
gttacccaac ttaatcgcct tgcagcacat 4740 
gaggcccgca ccgatcgccc ttcccaacag 4800 
atgcggtatt ttctccttac gcatctgtgc 4860 
agtacaatct gctctgatgc cgcatagtta 4 920 
gacgcgccct gacgggcttg tctgctcccg 4980 
tccgggagct gcatgtgtca gaggttttca 5040 
ggcctcgtga tacgcctatt tttataggtt 5100 
tcaggtggca cttttcgggg aaatgtgcgc 5160 
cattcaaata tgtatccgct catgagacaa 5220 
aaaaggaaga gtatgagtat tcaacatttc 5280 
ttttgccttc ctgttttt'gc tcacccagaa 5340 
cagttgggtg cacgagtggg ttacatcgaa 5400 
agttttcgcc ccgaagaacg ttttccaatg 54 60 
gcggtattat cccgtattga cgccgggcaa 5520 
cagaatgact tggttgagta ctcaccagtc 5580 
gtaagagaat tatgcagtgc tgccataacc 5640 
ctgacaacga tcggaggacc gaaggagcta 5700 
gtaactcgcc ttgatcgttg ggaaccggag 5760 
gacaccacga tgcctgtagc aatggcaaca 5820 
cttactctag cttcccggca acaattaata 5880 
ccacttctgc gctcggccct tccggctggc 5940 
gagcgtgggt ctcgcggtat cattgcagca 6000 
gtagttatct acacgacggg gagtcaggca 6060 
gagataggtg cctcactgat taagcattgg 6120 
ctttagattg atttaaaact tcatttttaa 6180 
gataatctca tgaccaaaat cccttaacgt 6240 
gtagaaaaga tcaaaggatc ttcttgagat 6300 
caaacaaaaa aaccaccgct accagcggtg 6360 
ctttttccga aggtaactgg cttcagcaga 6420 
tagccgtagt taggccacca cttcaagaac 6480 
ctaatcctgt taccagtggc tgctgccagt 6540 
tcaagacgat agttaccgga taaggcgcag 6600 
cagcccagct tggagcgaac gacctacacc 6660 
gaaagcgcca cgcttcccga agggagaaag 6720 
ggaacaggag agcgcacgag ggagcttcca 6780 
gtcgggtttc gccacctctg acttgagcgt 6840 
agcctatgga aaaacgccag caacgcggcc 6900 
tttgctcaca tgttctttcc tgcgttatcc 6960 
tttgagtgag ctgataccgc tcgccgcagc 7020 
gaggaagcgg aagagcgccc aatacgcaaa 7080 
taatgcagct ggcacgacag gtttcccgac 7140 
aatgtgagtt agctcactca ttaggcaccc 7200 
atgttgtgtg gaattgtgag cggataacaa 7260 
tacgccaagc tagcccgggc tagcttgcat 7320 

7332 



<210> 31 
<211> 72 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 31 

atgccatggg ctacggccgc aagaagcgcc gccaacgccg ccgcggcatg tccaatttac 60 
tgaccgtaca cc 72 



<210> 32 
<211> 25 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 32 

tttcggatcc gccgcataac cagtg 

<210> 33 
<211> 34 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 33 

tatatctaga ccatgggcta cggccgcaag aagc 



<210> 34 
<211> 43 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 34 

gctaccacga ccttcgatac catcgccatc ttccagcagg cgc 

<210> 35 
<211> 38 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 35 

taactagcgg ccgcatgtcc aatttactga ccgtacac 



<210> 36 
<211> 34 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 36 

tcgagcggcc gccatcgcca tcttccagca ggcg 

<210> 37 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 
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<223> Description of Artificial Sequence: primer 



<400> 37 

tatatctaga catatgacct ctcgccgctc eg 



32 



<210> 38 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 38 

ttccgaagac gacgaaacac c 



21 



<210> 39 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 



<400> 39 

tatattcgaa gcttattaac caccgaactg eg 



32 



<210> 40 
<211> 4847 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; 
pGK-cre-pA 



vector 



<400> 40 

aggtggcact 

ttcaaatatg 

aaggaagagt 

ttgccttcct 

gttgggtgca 

ttttcgcccc 

ggtattatcc 

gaatgacttg 

aagagaatta 

gacaacgatc 

aactcgcctt 

caccacgatg 

tactctagct 

acttctgcgc 

gcgtgggtct 

agttatctac 

gataggtgee 

ttagattgat 

taatctcatg 

agaaaagatc 

aacaaaaaaa 

ttttccgaag 

geegtagtta 

aatcctgtta 



tttcggggaa 
tatccgctca 
atgagtattc 
gtttttgetc 
cgagtgggtt 
gaagaacgtt 
cgtattgacg 
gttgagtact 
tgcagtgctg 
ggaggaccga 
gatcgttggg 
cctgtagcaa 
tcccggcaac 
tcggcccttc 
cgeggtatea 
aegaegggga 
tcactgatta 
ttaaaacttc 
accaaaatcc 
aaaggatctt 
ccaccgctac 
gtaactggct 
ggccaccact 
ccagtggctg 



atgtgcgcgg 

tgagacaata 

aacatttccg 

acccagaaac 

acatcgaact 

ttccaatgat 

cegggcaaga 

caccagtcac 

ccataaccat 

aggagctaac" 

aaceggaget 

tggcaacaac 

aattaataga 

cggctggctg 

ttgeagcact 

gtcaggcaac 

agcattggta 

atttttaatt 

cttaacgtga 

cttgagatcc 

cagcggtggt 

tcagcagagc 

tcaagaactc 

ctgccagtgg 



aacccctatt 
accctgataa 
tgtcgccctt 
gctggtgaaa 
ggatctcaac 
gagcactttt 
geaacteggt 
agaaaagcat 
gagtgataac 
cgcttttttg 
gaatgaagee 
gttgcgcaaa 
ctggatggag 
gtttattget 
ggggecagat 
tatggatgaa 
actgtcagac 
taaaaggatc 
gttttcgttc 
tttttttctg 
ttgtttgccg 
gcagatacca 
tgtagcaccg 
cgataagtcg 



tgtttatttt 
atgcttcaat 
attccctttt 
gtaaaagatg 
ageggtaaga 
aaagttctgc 
cgccgcatac 
ettaeggatg 
actgcggcca 
cacaacatgg 
ataccaaacg 
ctattaactg 
gcggataaag 
gataaatctg 
ggtaagcect 
cgaaatagac 
caagtttact 
taggtgaaga 
cactgagegt 
cgegtaatet 
gat caa gage 
aatactgtcc 
cctacatacc 
tgtcttaccg 



tctaaataca 
aatattgaaa 
ttgeggcatt 
ctgaagatca 
tccttgagag 
tatgtggcgc 
actattctca 
gcatgacagt 
acttacttct 
gggatcatgt 
acgagcgtga 
gcgaactact 
ttgeaggace 
gagccggtga 
cccgtatcgt 
agategctga 
catatatact 
tcctttttga 
cagaccccgt 
getgettgea 
taccaactct 
ttctagtgta 
tcgctctgct 
ggttggactc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 
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aagacgatag ttaccggata aggcgcagcg gtcgggctga acggggggtt cgtgcacaca 1500 
gcccagcttg gagcgaacga cctacaccga actgagatac ctacagcgtg agctatgaga 1560 
aagcgccacg cttcccgaag ggagaaaggc ggacaggtat ccggtaagcg gcagggtcgg 1620 
aacaggagag cgcacgaggg agcttccagg gggaaacgcc tggtatcttt atagtcctgt 1680 
cgggtttcgc cacctctgac ttgagcgtcg atttttgtga tgctcgtcag gggggcggag 1740 
cctatggaaa aacgccagca acgcggcctt tttacggttc ctggcctttt gctggccttt 1800 
tgctcacatg ttctttcctg cgttatcccc tgattctgtg gataaccgta ttaccgcctt 1860 
tgagtgagct gataccgctc gccgcagccg aacgaccgag cgcagcgagt cagtgagcga 1920 
ggaagcggaa gagcgcccaa tacgcaaacc gcctctcccc gcgcgttggc cgattcatta 1980 
atgcagctgg cacgacaggt ttcccgactg gaaagcgggc agtgagcgca acgcaattaa 2040 
tgtgagttag ctcactcatt aggcacccca ggctttacac tttatgcttc cggctcgtat 2100 
gttgtgtgga attgtgagcg gataacaatt tcacacagga aacagctatg accatgatta 2160 
cgccaagcgc gcaattaacc ctcactaaag ggaacaaaag ctgggtaccg ggccccccct 2220 
cgaggtcgac ggtatcgata agcttgatat cgaattctac cgggtagggg aggcgctttt 2280 
cccaaggcag tctggagcat gcgctttagc agccccgctg gcacttggcg ctacacaagt 2340 
ggcctctggc ctcgcacaca ttccacatcc accggtagcg ccaaccggct ccgttctttg 2400 
gtggcccctt cgcgccactt ctactcctcc cctagtcagg aagtttcccc cagcaagctc 24 60 
gcgtcgtgca ggacgtgaca aatggaagta gcacgtctca ctagtctcgt gcagatggac 2520 
agcaccgctg agcaatggaa gcgggtaggc ctttggggca gcggccaata gcagctttgt 2580 
tccttcgctt tctgggctca gaggctggga aggggtgggt ccgggggcgg gctcaggggc 2640 
qqqj££oa.qqq gcgggcgggc gcccgaaggt cetceegagg cccggcairtc tgcacgcttc 2700 
aaaagcgcac gtctgccgcg ctgttctcct cttcctcatc tccgggcctt tcgacctgca 2760 
gctcgaggtc gaccatgccc aagaagaaga ggaaggtgtc caatttactg accgtacacc 2820 
aaaatttgcc tgcattaccg gtcgatgcaa cgagtgatga ggttcgcaag aacctgatgg 2880 
acatgttcag ggatcgccag gcgttttctg agcatacctg gaaaatgctt ctgtccgttt 2940 
gccggtcgtg ggcggcatgg tgcaagttga ataaccggaa atggtttccc gcagaacctg 3000 
aagatgttcg cgattatctt ctatatcttc aggcgcgcgg tctggcagta aaaactatcc 3060 
agcaacattt gggccagcta aacatgcttc atcgtcggtc cgggctgcca cgaccaagtg 3120 
acagcaatgc tgtttcactg gttatgcggc ggatccgaaa agaaaacgtt gatgccggtg 3180 
aacgtgcaaa acaggctcta gcgttcgaac gcactgattt cgaccaggtt cgttcactca 3240 
tggaaaatag cgatcgctgc caggatatac gtaatctggc atttctgggg attgcttata 3300 
acaccctgtt acgtatagcc gaaattgcca ggatcagggt taaagatatc tcacgtactg 3360 
acggtgggag aatgttaatc catattggca gaacgaaaac gctggttagc accgcaggtg 3420 
tagagaaggc acttagcctg ggggtaacta aactggtcga gcgatggatt tccgtctctg 3480 
gtgtagctga tgatccgaat aactacctgt tttgccgggt cagaaaaaat ggtgttgccg 3540 
cgccatctgc caccagccag ctatcaactc gcgccctgga agggattttt gaagcaactc 3600 
atcgattgat ttacggcgct aaggatgact ctggtcagag atacctggcc tggtctggac 3660 
acagtgcccg tgtcggagcc gcgcgagata tggcccgcgc tggagtttca ataccggaga 3720 
tcatgcaagc tggtggctgg accaatgtaa atattgtcat gaactatatc cgtaacctgg 3780 
atagtgaaac aggggcaatg gtgcgcctgc tggaagatgg cgattagcca ttaacgcgta 3840 
aatgattgca gatccactag ttctagagct cgctgatcag cctcgactgt gccttctagt 3900 
tgccagccat ctgttgtttg cccctccccc gtgccttcct tgaccctgga aggtgccact 3960 
cccactgtcc tttcctaata aaatgaggaa attgcatcgc attgtctgag taggtgtcat 4020 
tctattctgg ggggtggggt ggggcaggac agcaaggggg aggattggga agacaatagc 4080 
aggcatgctg gggatgcggt gggctctatg gcttctgagn nngaaagaac cagctggggc 4140 
tcgagatcca ctagttctag cctcgaggct agagcggccg ccaccgcggt ggagctccaa 4200 
ttcgccctat agtgagtcgt attacgcgcg ctcactggcc gtcgttttac aacgtcgtga 4260 
ctgggaaaac cctggcgtta cccaacttaa tcgccttgca gcacatcccc ctttcgccag 4320 
ctggcgtaat agcgaagagg cccgcaccga tcgcccttcc caacagttgc gcagcctgaa 4380 
tggcgaatgg gacgcgccct gtagcggcgc attaagcgcg gcgggtgtgg tggttacgcg 4 440 
cagcgtgacc gctacacttg ccagcgccct agcgcccgct cctttcgctt tcttcccttc 4500 
ctttctcgcc acgttcgccg gctttccccg tcaagctcta aatcgggggc tccctttagg 4560 
gttccgattt agtgctttac ggcacctcga ccccaaaaaa cttgattagg gtgatggttc 4 620 
acgtagtggg ccatcgccct gatagacggt ttttcgccct ttgacgttgg agtccacgtt 4680 
ctttaatagt ggactcttgt tccaaactgg aacaacactc aaccctatct cggtctattc 4740 
ttttgattta taagggattt tgccgatttc ggcctattgg ttaaaaaatg agctgattta 4800 
acaaaaattt aacgcgaatt ttaacaaaat attaacgctt acaattt , 4847 

<210> 41 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: primer 
<400> 41 

catctccggg cctttcgacc tg 22 



<210> 42 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 42 

gcgatcggtg cgggcctctt c 21 
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Claims 

1. Use of a fusion protein comprising 

(a) a site-specific DNA recombinase domain and 

(b) a protein transduction domain (PTD) 

for preparing an agent for inducing target gene alterations in a living 
organism or cell culture, wherein said living organism carries at least one 
or more recognition sites for said site-specific DNA recombinase integrated 
in an endogenous gene. 

2. The use of claim 1, wherein the PTD is not derived from Antennapedia 
and preferably is a PTD derived from the VP22 protein of HSV or from the 
TAT protein of HIV. 

3. Use of a fusion protein comprising 

(a) a site-specific DNA recombinase domain and 

(b) a protein transduction domain (PTD) being not. derived from 
Antennapedia and preferably being derived from the VP22 protein of HSV 
or from the TAT protein of HIV 

for preparing an agent for inducing target gene alterations in a living 
organism" or cell culture, wherein said living organism carries at least one 
or more recognition sites for said site-specific DNA recombinase integrated 
in its genome. 

4. The use of claim 3, wherein the recognition sites for said site specific 
recombinase is present within an endogenous gene or a transgene. 

5. The use of any one of claims 2 to 4, wherein the TAT protein comprises 
(i) the amino acid sequence YGRKKRRQRRR (SEQ ID NO: 10) or a mutant 
thereof including 

\ (ii) peptides having the amino sequences 
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AGRKKRRQRRR (SEQ ID NO: 22) 
YARKARRQARR (SEQ ID NO: 23) 
YARAAARQARA (SEQ ID NO: 24) 
YARAARRAARR (SEQ ID NO: 25) 
YARAARRAARA (SEQ ID NO: 26) 
YARRRRRRRRR (SEQ ID NO: 27) 
YAAARRRRRRR (SEQ ID NO: 28); 

preferably the TAT protein consists of one of the sequences shown in (i) or 
(ii) above. 

6. The use of any one of claims 2 to 4, wherein the VP22 protein 
comprises the amino acid 16-157 of SEQ ID NO: 14. 

7. The use of any one of claims 1 to 6, wherein the site-specific DNA 
recombinase domain is selected from a recombinase protein derived from 
Cre, Flp, <}>C31 recombinase, and R recombinase and preferably is Cre 
having amino acids 15 to 357 of SEQ ID NO: 2 or Flpe having amino acids 
15 to 437 of SEQ ID NO: 4. 

8. The use of any one of claims 1 to 7, wherein the protein transduction 
domain is fused to the N-terminal of the site-specific DNA recombinase 
domain. 

9. The use of any one of claims 1 to 8, wherein the protein transduction 
domain is fused to the site-specific DNA recombinase domain through a 
direct chemical bond or through a linker molecule. 

10. The use of any one of claim 9, wherein the linker molecule is a short 
peptide having 1 to 20, preferably 1 to 10 amino acid residues. 



WO 01/49832 PCT/EP01/00060 

72 

11. The use of any one of claims 1 to 10, wherein said fusion protein 
further comprises additional functional sequences. 

12. The use of claim 1, wherein the fusion protein has the sequence 
shown in SEQ ID NOs: 2, 4, 6 or 8. 

13. The use of any one of claims 1 to 12, wherein the living organism is a 
vertebrate, preferably a rodent or a fish. 

14. A method for inducing gene aLterations in 2 Hving organism which 
comprises administering to said living organism, a fusion protein 
comprising a site-specific DNA recombinase domain and a protein 
transduction domain as defined in claims 1 to 12, wherein said living 
organism carries at least one or more recognition sites for said site- 
specific DNA recombinase integrated in its genome. 

15. A fusion protein comprising 

(a) a site-specific DNA recombinase domain as defined in claims 2 to 9 
and 

(b) a protein transduction domain (PTD) as defined in claims 2 to 9 
provided that when (a) is the wild-type Flp or Cre then (b) is not the full 
length VP22 protein of HSV. 

16. The fusion of claim 15, wherein the (PTD) is derived from the TAT 
protein of HIV. 

17. A DNA sequence coding for the fusion protein of claim 15 or 16, said 
DNA sequence preferably comprising the sequence shown in SEQ ID 
NOs:l, 3, 5, 7, 9, 11, 13, 18 and/or 20. 



18. A vector comprising the DNA sequence of claim 17. 
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19. A host cell transformed with the vector of claim 18 and/or comprising 
the DNA of claim 17. 

20. A method for producing the fusion protein of claim 15 which comprises 
culturing the transformed host cell of claim 19 and isolating the fusion 
protein. 

21. An injectable composition comprising the fusion protein as defined in 
claims 1 to 12 or 15 to 16. 
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Fig. 1 
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Figure 3 
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Fig. 5 
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Figure 6 
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Fig. 7 

A 



Ndel 

ATG VP22 <j re Hjnd IH 




SUBSTITUTE SHEET (RULE 26) 



WO 01/49832 



8/11 



PCT/EP01/00060 




123456789 10 



^ til 
ffijf <[&p — 



Coomassie 



123456789 10 




a Strep tag 



Figure 8 
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Fig. 10 
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